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Chapter-1 



The objective of this chapter is to explain the importance of analysis of algorithms, their notations, relationships and 
solving as many problems as possible. We first concentrate on understanding the basic elements of algorithms, 
importance of analysis and then slowly move towards analyzing the algorithms with different notations and finally the 
problems. After completion of this chapter you should be able to find the complexity of any given algorithm 
(especially recursive functions). 

1.1 Variables 

Before going to the definition of variables, let us relate them to old mathematical equations. All of us have solved 
many mathematical equations since childhood. As an example, consider the below equation: 

x 2 + 2y - 2 = 1 

We don’t have to worry about the use of above equation. The important thing that we need to understand is, the 
equation has some names ( x and y) which hold values (data). That means, the names (x and y) are the place holders 
for representing data. Similarly, in computer science we need something for holding data and variables are the 
facility for doing that. 

1.2 Datatypes 

In the above equation, the variables x and y can take any values like integral numbers (10, 20 etc...), real numbers 
(0.23, 5.5 etc...) or just 0 and 1. To solve the equation, we need to relate them to kind of values they can take and 
data type is the name being used in computer science for this purpose. 

A data type in a programming language is a set of data with values having predefined characteristics. Examples of 
data types are: integer, floating point unit number, character, string etc... 


Computer memory is all filled with zeros and ones. If we have a problem and wanted to code it, it’s very difficult to 
provide the solution in terms of zeros and ones. To help users, programming languages and compilers are providing the 
facility of data types. 

For example, integer takes 2 bytes (actual value depends on compiler), float takes 4 bytes etc... This says that, in 
memory we are combining 2 bytes (16 bits) and calling it as integer. Similarly, combining 4 bytes (32 bits) and 
calling it as float. A data type reduces the coding effort. Basically, at the top level, there are two types of data types: 

• System defined data types (also called Primitive data types) 

• User defined data types 

System defined data types (Primitive data types) 

Data types which are defined by system are called primitive data types. The primitive data types which are provided 
by many programming languages are: int, float, char, double, bool, etc... 


1.1 Variables 
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The number of bits allocated for each primitive data type depends on the programming languages, compiler and 
operating system. For the same primitive data type, different languages may use different sizes. Depending on the size 
of the data types the total available values (domain) will also changes. For example, “ int” may take 2 bytes or 4 bytes. 
If it takes 2 bytes (16 bits) then the total possible values are -32,768 to +32,767 (-2 15 to 2 15 -1). If it takes, 4 bytes (32 
bits), then the possible values are between —2,147,483,648 to +2,147,483,648 (-2 31 to 2 31 -1). Same is the case with 
remaining data types too. 

User defined data types 

If the system defined data types are not enough then most programming languages allows the users to define their own 
data types called as user defined data types. Good example of user defined data types are: structures in C/C + + and 
classes in Java. 

For example, in the below case, we are combining many system defined data types and called it as user defined data 
type with name “newType” . This gives more flexibility and comfort in dealing with computer memory, 
struct newType { 
int datal; 
float data 2; 

char data; 

}; 


1.3 Data Structure 

Based on the above discussion, once we have data in variables, we need some mechanism for manipulating that data to 
solve problems. Data structure is a particular way of storing and organizing data in a computer so that it can be used 
efficiently. That means, a data structure is a specialized format for organizing and storing data. General data 
structure types include arrays, files, linked lists, stacks, queues, trees, graphs and so on. 

Depending on the organization of the elements, data structures are classified into two types: 

1) Linear data structures: Elements are accessed in a sequential order but it is not compulsory to store all 
elements sequentially (say, Linked Lists). Examples: Linked Lists, Stacks and Queues. 

2) Non — linear data structures: Elements of this data structure are stored/accessed in a non-linear order. 
Examples: Trees and graphs. 

1.4 Abstract Data Types (ADTs) 

Before defining abstract data types, let us consider the different view of system defined data types. We all know that, 
by default, all primitive data types (int, float, et..) supports basic operations like addition, subtraction etc... The system 
is providing the implementations for the primitive data types. For user defined data types also we need to define 
operations. The implementation for these operations can be done when we want to actually use them. That means, in 
general user defined data types are defined along with their operations. 

To simplify the process of solving the problems, we generally combine the data structures along with their operations 
and are called Abstract Data Types (ADTs). An ADT consists of two parts: 

1. Declaration of data 

2. Declaration of operations 


1.3 Data Structure 
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Commonly used ADTs include: Linked Lists, Stacks, Queues, Priority Queues, Binary Trees, Dictionaries, Disjoint Sets 
(Union and Find), Hash Tables, Graphs, and many other. For example, stack uses LIFO (Last-In-First-Out) mechanism 
while storing the data in data structures. The last element inserted into the stack is the first element that gets deleted. 
Common operations of it are: creating the stack, pushing an element onto the stack, popping an element from stack, 
finding the current top of the stack, finding number of elements in the stack etc... 

While defining the ADTs do not care about implementation details. They come in to picture only when we want to 
use them. Different kinds of ADTs are suited to different kinds of applications, and some are highly specialized to 
specific tasks. By the end of this book, we will go through many of them and you will be in a position to relate the data 
structures to the kind of problems they solve. 

1.5 What is an Algorithm? 

Let us consider the problem of preparing an omelet. For preparing omelet, general steps we follow are: 

1) Get the frying pan. 

2) Get the oil. 

a. Do we have oil? 

i. If yes, put it in the pan. 

ii. If no, do we want to buy oil? 

1. If yes, then go out and buy. 

2. If no, we can terminate. 

3) Turn on the stove, etc... 

What we are doing is, for a given problem (preparing an omelet), giving step by step procedure for solving it. Formal 
definition of an algorithm can be given as: 

An algorithm is the step-by-step instructions to solve a given problem. 

Note: we do not have to prove each step of the algorithm. 

1.6 Why Analysis of Algorithms? 

To go from city “A" to city “B", there can be many ways of accomplishing this: by flight, by bus, by train and also by 
cycle. Depending on the availability and convenience we choose the one which suits us. Similarly, in computer science 
there can be multiple algorithms exist for solving the same problem (for example, sorting problem has many 
algorithms like insertion sort, selection sort, quick sort and many more). Algorithm analysis helps us determining 
which of them is efficient in terms of time and space consumed. 

1.7 Goal of Analysis of Algorithms 

The goal of analysis of algorithms is to compare algorithms (or solutions) mainly in terms of running time but also 
in terms of other factors (e.g., memory, developers effort etc.) 

1.8 What is Running Time Analysis? 


It is the process of determining how processing time increases as the size of the problem (input size) increases. Input 
size is number of elements in the input and depending on the problem type the input may be of different types. In 
general, we encounter the following types of inputs. 

• Size of an array 


1.5 What is an Algorithm? 
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• Polynomial degree 

• Number of elements in a matrix 

• Number of bits in binary representation of the input 

• Vertices and edges in a graph 

1.9 How to Compare Algorithms? 

To compare algorithms, let us define few objective measures: 

Execution times? Not a good measure as execution times are specific to a particular computer. 

Number of statements executed? Not a good measure, since the number of statements varies with the 
programming language as well as the style of the individual programmer. 

Ideal Solution? Let us assume that we expressed running time of given algorithm as a function of the input size n 
(i.e., f(nj) and compare these different functions corresponding to running times. This kind of comparison is 
independent of machine time, programming style, etc... 


1.10 What is Rate of Growth? 

The rate at which the running time increases as a function of input is called rate of growth. Let us assume that you 
went to a shop for buying a car and a cycle. If your friend sees you there and asks what you are buying then in general 
we say buying a car. This is because, cost of car is too big compared to cost of cycle (approximating the cost of cycle 
to cost of car). 

Total Cost = cost_of_car + cost_of _cycle 
Total Cost ~ cost_of_car ( approximation ) 

For the above example, we can represent the cost of car and cost of cycle in terms of function and for a given function 
ignore the low order terms that are relatively insignificant (for large value of input size, n). As an example in the 
below case, n 4 , 2n 2 , lOOn and 500 are the individual costs of some function and approximate it to n 4 . Since, n 4 is the 
highest rate of growth. 

n 4 + 2n 2 + lOOn + 500 ~ n 4 

1.11 Commonly used Rate of Growths 

Below is the list of rate of growths which come across in remaining chapters. 


Time complexity 

Name 

Example 

1 

Constant 

Adding an element to the front of a finked fist 

logn 

Logarithmic 

Finding an element in a sorted array 

n 

Linear 

Finding an element in an unsorted array 

nlogn 

Linear Logarithmic 

Sorting n items by ‘divide-and-conquer’-Mergesort 

n 2 


Shortest path between two nodes in a graph 

n 3 

Cubic 

Matrix Multiplication 

2 n 

Exponential 

The Towers of Hanoi problem 


Below diagram shows the relationship between different rates of growth. 


1.9 How to Compare Algorithms? 
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To analyze the given algorithm we need to know on what inputs the algorithm is taking less time (performing well) 
and on what inputs the algorithm is taking huge time. We have already seen that an algorithm can be represented in 
the form of an expression. That means we represent the algorithm with multiple expressions: one for case where it is 
taking the less time and other for case where it is taking the more time. In general the first case is called the best case 
and second case is called the worst case of the algorithm. To analyze an algorithm we need some kind of syntax and 
that forms the base for asymptotic analysis/notation. There are three types of analysis: 

• Worst case 

o Defines the input for which the algorithm takes huge time. 

o Input is the one for which the algorithm runs the slower. 

• Best case 

o Defines the input for which the algorithm takes lowest time. 


1.12 Types of Analysis 
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o Input is the one for which the algorithm runs the fastest. 

• Average case 

o Provides a prediction about the running time of the algorithm 
o Assumes that the input is random 

Lower Bound <= Average Time <= Upper Bound 

For a given algorithm, we can represent best, worst and average cases in the form of expressions. As an example, let 
/(n) be the function which represents the given algorithm. 

/(n) = n 2 + 500, for worst case 
f(n) = n + lOOn + 500, for best case 

Similarly, for average case too. The expression defines the inputs with which the algorithm takes the average running 
time (or memory). 

1.13 Asymptotic Notation 

Having the expressions for best, average case and worst cases, for all the three cases we need to identify the upper and 
lower bounds. In order to represent these upper and lower bounds we need some kind syntax and that is the subject of 
following discussion. Let us assume that the given algorithm is represented in the form of function /(n). 


1.14 Big-O Notation 


This notation gives the tight upper bound of the given function. Generally, it is represented as f(ri) = 0(g(n)). That 
means, at larger values of n, the upper bound of /(n) is g (n). For example, if /(n) = n 4 + 100n 2 + lOn + 50 is the 
given algorithm, then n 4 is g(n). That means, g(n) gives the maximum rate of growth for /(n) at larger values of n. 

Let us see the O-notation with little more detail. O-notation defined as 0(g (n)) = {f(n): there exist positive 
constants c and n 0 such that 0 < f(n) < cg(n) for alln > n 0 }. g(n) is an asymptotic tight upper bound for /(n). 
Our objective is to give smallest rate of growth g(n) which is greater than or equal to given algorithms rate of 
growth/(n). 


In general, we discard lower values of n. That means the rate of growth at lower values of n is not important. In the 
below figure, n 0 is the point from which we need to consider the rate of growths for a given algorithm. Below n 0 the 
rate of growths could be different. 


Big-0 Visualization 



O (g(n)) is the set of functions with smaller or same order of growth as g(n). For example, 0(n 2 ) includes 
0(1), O(n), O (nlogn) etc.. 


1.13 Asymptotic Notation 
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Note: Analyze the algorithms at larger values of n only. What this means is, below n 0 we do not care for rate of 
growth. 


Big-0 Examples 





Example-1 Find upper bound for f(ri) = 3n + 8 
Solution: 3n + 8 < 4n, for all n > 1 

3n + 8 = O(n) with c = 4 and n 0 = 8 


Example-2 Find upper bound for /(n) = n 2 + 1 
Solution: n 2 + 1 < 2n 2 , for all n > 1 

n 2 + 1 = 0(n 2 ) with c = 2 andn 0 = 1 


Example-3 Find upper bound for f(n) = n 4 -f 100n 2 + 50 
Solution: n 4 + 100n 2 + 50 < 2n 4 , for all n > 11 

n 4 + lOOn 2 + 50 = O(n 4 ) with c = 2 and n 0 = 11 

Example-4 Find upper bound for /(ft) = 2n 3 — 2n 2 
Solution: 2n 3 — 2 n 2 < 2n 3 , for all n > 1 

2n 3 - 2ft 2 = 0(2n 3 ) with c = 2 and n 0 = 1 


Example-5 Find upper bound for f(ri) =n 
Solution: ft < n 2 , for all n > 1 

••• ft = O(ft 2 ) with c = 1 and n 0 = 1 


Example-6 Find upper bound for /(n) = 410 
Solution: 410 < 410, for all ft > 1 

••• 100 = 0(1) with c = 1 and n 0 = 1 


No Uniqueness? 


There are no unique set of values for n 0 and c in proving the asymptotic bounds. Let us consider, lOOn + 5 =0(n 2 ). 
For this function there are multiple n 0 and c values possible. 

Solutionl: 100ft 4- 5 < lOOn + n = 101 n < 101n 2 for all n > 5, n 0 = 5 and c = 101 is a solution. 

Solution2: lOOn + 5 < lOOn + 5n = 105n < 105n 2 for all n > l,n 0 = 1 and c = 105 is also a solution. 


1.14 Big-0 Notation 
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1.15 Omega-fi Notation 

Similar to O discussion, this notation gives the tighter lower bound of the given algorithm and we represent it 
as/(n) = QThat means, at larger values of n, the tighter lower bound of /(n) is g(n). For example, if 
f(n ) = 100n 2 + lOn + 50, g(ri) is Q(n 2 ). 

Rate of Growth 



The H notation can be defined as fl(g(n)) = (f(n): there exist positive constants c and n 0 such that 0 < cg(n) < 
f(n ) for all n > n 0 ). g(n) is an asymptotic tight lower bound for /(n). Our objective is to give largest rate of growth 
g(n) which is less than or equal to given algorithms rate of growth / (n). 

Q Examples 

Example-1 Find lower bound for f(ri) = 5n 2 

Solution: 3 c, n 0 Such that: 0 <cn <5n 2 =>cn <5 n 2 =>c = 1 and n 0 = 1 
5n 2 = Q(n) with c = 1 and n 0 = 1 

Example-2 Prove f(ri) = lOOrt + 5 ^ Q(n 2 ) 

Solution: 3 c, n 0 Such that: 0 <cn 2 <100n + 5 

lOOn + 5 <100n + 5n(Vn>l) = 105n 
cn 2 <105 n=>n(cn - 105) <0 
Since n is positive =>cn- 105 <0 => n <105 /c 
=> Contradiction: n cannot be smaller than a constant 

Example-3 n = f2(2n), n 3 = Q(n 2 ), n = Q (logri) 

1.16 Theta-0 Notation 

This notation decides whether the upper and lower bounds of a given function (algorithm) are same or not. The 
average running time of algorithm is always between lower bound and upper bound. If the upper bound (O) and lower 
bound (Q) gives the same result then © notation will also have the same rate of growth. As an example, let us assume 
that /(n) = lOn + n is the expression. Then, its tight upper bound gin) is O(n). The rate of growth in best case 
is g(n) =0(n). In this case, rate of growths in best case and worst are same. As a result, the average case will also be 
same. For a given function (algorithm), if the rate of growths (bounds) for O and Q are not same then the rate of 
growth 0 case may not be same. 


1.15 Omega-Q Notation 
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Rate of Growth 



Now consider the definition of 0 notation. It is defined as ®(g(ri)) = {/(n): there exist positive constants 
c i* c 2 and n 0 such that 0 < edgin') < f(n) < c 2 g(n ) for all n > n 0 }. g(n) is an asymptotic tight bound for 
/(n). ®(g (n)) is the set of functions with the same order of growth as g(n). 

© Examples 

Example-1 Find 0 bound for /(n) = ^^ 

Solution: Y - T _ 2 < n 2 , for all, n > 1 

2 

Y - f = ®(n 2 ) with q = 1/5, c 2 = 1 and n 0 = 1 
Example-2 Prove n 0(n 2 ) 

Solution: q n 2 <n< c 2 n 2 => only holds for: n < 1/q 
n ^ 0(n 2 ) 

Example-3 Prove 6n 3 * 0(n 2 ) 

Solution: q n 2 s 6n 3 sc 2 n 2 => only holds for: n < c 2 /6 
••• 6n 3 =£ 0(n 2 ) 

Example-4 Prove n + ®(logn) 

Solution: c Jogn < n < c 2 logn => c 2 > ~ , y n > n 0 - Impossible 


Important Notes 

For analysis (best case, worst case and average) we try to give upper bound (O) and lower bound (Q) and average 
running time (0). From the above examples, it should also be clear that, for a given function (algorithm) getting upper 
bound (O) and lower bound (Q) and average running time (0) may not be possible always. For example, if we are 
discussing the best case of an algorithm, then we try to give upper bound (O) and lower bound (Q) and average 
running time (©). In the remaining chapters we generally concentrate on upper bound (O) because knowing lower 
bound (O) of an algorithm is of no practical importance and we use 0 notation if upper bound (O) and lower bound 
(Q) are same. 

1.17 Why is it called Asymptotic Analysis? 

From the above discussion (for all the three notations: worst case, best case and average case), we can easily 
understand that, in every case for a given function /(n) we are trying to find other function g(n) which approximates 
f(n) at higher values of n. That means, g(n) is also a curve which approximates /(n) at higher values of n. In 


1.17 Why is it called Asymptotic Analy sis? 
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mathematics we call such curve as asymptotic curve. In other terms, g(n) is the asymptotic curve for /(n). For this 
reason, we call algorithm analysis as asymptotic analysis. 

1.18 Guidelines for Asymptotic Analysis 

There are some general rules to help us in determining the running time of an algorithm. 

1) Loops: The running time of a loop is, at most, the running time of the statements inside the loop (including tests) 
multiplied by the number of iterations. 

// executes n times 
for (i=l; i<=n; i++) 

m = m + 2; // constant time, c 
Total time = a constant c X n = c n =0(n). 

2) Nested loops: Analyze from inside out. Total running time is the product of the sizes of all the loops. 

//outer loop executed n times 

for (i=l; i<=n; i++) { 

// inner loop executed n times 
for (j=l; j<=n; j++) 

k = k+1; //constant time 

} 

Total time = cxnxn = cn 2 = 0(n 2 ). 

3) Consecutive statements: Add the time complexities of each statement, 
x = x +1; //constant time 

// executed n times 
for (i=l; i<=n; i++) 

m = m + 2; //constant time 
//outer loop executed n times 
for (i=l; i<=n; i++) { 

//inner loop executed n times 
for (j=l; j<=n; j++) 

k = k+1; //constant time 

} 

Total time = c 0 + c t n + c 2 n 2 = 0(n 2 ). 

4) If- then -else statements: Worst-case running time: the test, plus either the then part orthe else part (whichever is 
the larger). 

//test: constant 
if(length() == 0) { 

return false; //then part: constant 

} 

else {// else part: (constant + constant) * n 
for (int n = 0; n < length(); n++) { 

// another if: constant + constant (no else part) 
if(!list[n].equals(otherList.list[n])) 

//constant 
return false; 

} 

1 

Total time = 00 + 0 ! + (c 2 + c 3 ) * n =0(n). ___ 


1.18 Guidelines for Asymptotic Analysis 
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5) Logarithmic complexity: An algorithm is O(logn ) if it takes a constant time to cut the problem size by a fraction 
(usually by V 2 ). As an example let us consider the following program: 


for (i=l; i<=n;) 
i = i*2; 

If we observe carefully, the value of i is doubling every time. Initially i = 1, in next step i = 2, and in subsequent 
steps i — 4,8 and so on. Let us assume that the loop is executing some k times. At k th step 2‘ = n and we come 
out of loop. Taking logarithm on both sides, gives 

log( 2 1 ) = logn 
ilogl = logn 

i = logn //if we assume base-2 

Total time = O (logn). 

Note: Similarly, for the below case also, worst case rate of growth is O (logn). The same discussion holds good for 
decreasing sequence as well. 


for (i=n; i<=l;) 
i = i/2; 

Another example: binary search (finding a word in a dictionary of n pages) 

• Look at the center point in the dictionary 

• Is word towards left or right of center? 

• Repeat process with left or right part of dictionary until the word is found 

1.19 Properties of Notations 


• Transitivity: f(n) = &(g(n)) and g(n) = ®(h(n)) => f(ri) = ®(h(n)). Valid for O and Q as well. 

• Reflexivity: f(n) = ®(f(n)). Valid for O and D. also. 

• Symmetry: /(n) = ®(g(n)) if and only if g(n) = ®(/(n)). 

• Transpose symmetry: f(ri) = 0(g(n)) if and only if g(n) = Q(f(nJ). 

1.20 Commonly used Logarithms and Summations 


Logarithms 

log x v = y log x 
log xy = logx + logy 
log logn = log (logn) 

a l °3b = x L ° B b 

Arithmetic series 


logn = log? 0 
log k n = (logn) k 
l°g^= logx - logy 



2 >= 


1 + 2-1-h n = 


n(n + 1) 
2 


Geometric series 


x n+l — 1 

x k = 1 + X + X 2 ... + x n = - (x + 1 ) 

, „ x-1 K J 

k =0 

Harmonic series 

111 
k = 1 + 2 + - + - al °9n 
k= 1 ___ 
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Other important formulae 

71 

^ log k * nlogn 

k=l 

n 

= ip+ 2 P + - + n p 


»p+i 


p 4- 1 


1.21 Master Theorem for Divide and Conquer 

All divide and conquer algorithms (In detail, we will discuss them in Divide and Conquer chapter) divides the 
problem into subproblems, each of which is part of the original problem, and then perform some additional work to 
compute the final answer. As an example, merge sort algorithm [for details, refer Sorting chapter] operates on two 
subproblems, each of which is half the size of the original and then performs 0(n) additional work for merging. This 
gives the running time equation: 

T(n) = 2T g) + O(n) 

The following theorem can be used to determine the running time of divide and conquer algorithms. For a given 
program (algorithm), first we try to find the recurrence relation for the problem. If the recurrence is of the below form 
then we can directly give the answer without fully solving it. 

If the recurrence is of the form T(n) = aT (p) + ®(n k log p ri), where a > 1, b > 1, k > 0 and p is a real number, 
then: 

1) If a > b k , then T(n) = &(n l °«S) 

2) If a = b k 

a. If p > -1, then T(n) = ®(n l ° 3 blog p+1 n ) 

b. If p = -1, then T'(n) = Q(n log h loglogn) 

c. If p < —1, then T(n ) = @(n iofl *) 

3) If a < b k 

a. If p > 0, then T(n) = ®(n k log p n ) 

b. Ifp < 0,thenT(n) =0(n k ) 


1, 22 Problems on Divide and Conquer Master Theorem 

For each of the following recurrences, give an expression for the runtime T(n ) if the recurrence can be solved with the 
Master Theorem. Otherwise, indicate that the Master Theorem does not apply. 

Problem-1 T(n) = 3 T (n/2) + n 2 

Solution: T(n) = 3 T (n/2) + n 2 => T (n) =@(n 2 ) (Master Theorem Case 3.a) 

Problem-2 T(n) = 4 T (n/2) + n 2 

Solution: T(n) = AT (n/2) + n 2 => T (n) = ®(n 2 logn) (Master Theorem Case 2.a) 

Problem-3 T(n) = T(n/2) + n 2 

Solution: T(n ) = T(n/ 2) + n 2 => 0(n 2 ) (Master Theorem Case 3.a) 

Problem-4 T(n) = 2 n T(n/2) + n n 

Solution: T(n ) = 2 n T(n/2) + n n => Does not apply (a is not constant) 

Problem-5 Tin) = 16T(n/4) + n 


1.21 Master Theorem for Divide and Conquer 
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Solution: T(ri) = 167 (n/4) + n => T(n) = 0(n 2 ) (Master Theorem Case 1) 

Problem-6 T(n) = 27(n/2) + nlogn 

Solution: 7(n) = 27 (n/2) 4- nlogn => T(n ) = Q(nlog 2 n) (Master Theorem Case 2.a) 
Problem-7 T(n) = 2T(n/2) + n/Zo,pn 

Solution: 7(n) = 2T(n/2) + n/logn =>T(n) = ©(nloglogri) (Master Theorem Case 2.b) 
Problem-8 T(n) = 27' (n/4) + n 0,51 

Solution: 7'(n) = 27(n/4) + n 0,51 => 7 (n) = 0(n° 51 ) (Master Theorem Case 3,b) 

Problem-9 7(n) = 0.57(n/2) + 1/n 

Solution: 7(n) = 0.57'(n/2) + 1/n => Does not apply (a < 1) 

Problem-10 T (n) = 6 T (n/3) + n 2 logn 

Solution: 7’(n) = 6T(n/3) + n 2 logn => T(ri) = Q(n 2 logn) (Master Theorem Case 3.a) 
Problem-11 T (n) = 647’(n/8) — n 2 logn 

Solution: 7'(n) = 647’(n/8) — n 2 logn —> Does not apply (function is not positive) 
Problem-12 T (n) = 77’(n/3) + n 2 

Solution: T(n ) = 7Tin/3) + n 2 => T(n) = 0(n 2 ) (Master Theorem Case 3.as) 
Problem-13 T (n) = 47’ (n/2) + logn 

Solution: T(n) = 4T(n/2) + logn => T’(n) = 0(n 2 ) (Master Theorem Case 1) 
Problem-14 T (n) = 167 (n/4) + n! 

Solution: 7(n) = 167 (n/4) + n! => 7(n) = 0(n!) (Master Theorem Case 3.a) 
Problem-15 7 (n) = V27 (n/2) + logn 

Solution: 7(n) = V27 (n/2) + logn => 7(n) = 0(\/n) (Master Theorem Case 1) 
Problem-16 7 (n) = 37 (n/2) + n 

Solution: 7(n) = 37 (n/2) + n => 7(n) = 0(n iOfl3 ) (Master Theorem Case 1) 
Problem-17 7(n) = 37 (n/3) + Vn 

Solution: 7(n) = 37 (n/3) + Vn => 7(n) = 0(n) (Master Theorem Case 1) 

Problem-18 T(n) = 47 (n/2) + cn 

Solution: 7(n) = 47 (n/2) + cn => 7(n) = 0(n 2 ) (Master Theorem Case 1) 

Problem-19 T(n) = 37 (n/4) + nlogn 

Solution: 7(n) = 37 (n/4) + nlogn => T(ri) = ®(nlogn) (Master Theorem Case 3.a) 
Problem-20 7 (n) = 37 (n/3) + n/2 

Solution: 7(n) = 37 (n/3) + n/2 => 7 (n) = ©(nlogn) (Master Theorem Case 2.a) 


1.23 Master Theorem for Subtract and Conquer Recurrences 
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1.23 Master Theorem for Subtract and Conquer Recurrences 

Let T(n) be a function defined on positive n, and having the property 

(c, if n < 1 

T{jl) ~ {aT(n -b) + /(n), if n > 1 

for some constants c, a > 0, b > 0, k > 0, and function f(n). If f(n) is in 0(n k ), then 

( 0(n k ), if a < 1 

T(n) = \o(n k+1 ), if a = 1 

ifa>l 

1.24 Variant of subtraction and conquer master theorem 

The solution to the equation f(n) = T(a n) + T(( 1 - a)n) + pn, where 0 < a < 1 and /S > 0 are constants, is 
O (nlogri). 

1.25 Amortized Analysis 

Amortized analysis refers to determining the time-averaged running time for a sequence of operations. It is different 
from average case analysis, because amortized analysis does not make any assumption about the distribution of the 
data values, whereas average case analysis assumes the data are not "bad" (e.g., some sorting algorithms do well on 
"average" over all input orderings but very badly on certain input orderings). That is, amortized analysis is a worst 
case analysis, but for a sequence of operations, rather than for individual operations. 

The motivation for amortized analysis is to better understand the running time of certain techniques, where standard 
worst case analysis provides an overly pessimistic bound. Amortized analysis generally applies to a method that 
consists of a sequence of operations, where the vast majority of the operations are cheap, but some of the operations 
are expensive. If we can show that the expensive operations are particularly rare we can "charge them" to the cheap 
operations, and only bound the cheap operations. 

The general approach is to assign an artificial cost to each operation in the sequence, such that the total of the artificial 
costs for the sequence of operations bounds total of the real costs for the sequence. This artificial cost is called the 
amortized cost of an operation. In order to analyze the running time, the amortized cost thus is a correct way of 
understanding the overall running time — but note that particular operations can still take longer so it is not a way of 
bounding the running time of any individual operation in the sequence. 

When one event in a sequence affects the cost of later events: 

• One particular task may be expensive. 

• But it ma y leave data structure in a state that next few operations becomes easier. 

Example: Let us consider an array of elements from which we want to find k th smallest element. We can solve this 
problem using sorting. After sorting the given array, we just need to return the k th element from it. Cost of 
performing sort (assuming comparison based sorting algorithm) is O (nlogn). If we perform n such selections then the 
average cost of each selection is O {nlogn/ri) = 0(logn). This clearly indicates that sorting once is reducing the 
complexity of subsequent operations. 

1.26 Problems on Algorithms Analysis 

Note: From the following problems, try to understand the cases which give different complexities 
(O(n), O (logn), O (loglogn) etc...). 

Problem-21 Find the complexity of the below recurrence: 

I . (n)= F(-lWn > 0, 

(1, otherwise 


1.23 Master Theorem for Subtract and Conquer Recurrences 
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Solution: Let us try solving this function with substitution. 
T(n) = 3 T(n - 1) 

T(n) = 3(3 T(n - 2)) = 3 2 T(n - 2) 

T(n ) = 3 2 (3T(n — 3)) 


T(ri) = 3 n T(n - n) = 3 n T(0) = 3 n 

This clearly shows that the complexity of this function is 0(3 n ). 

Note: We can use the Subtraction and Conquer master theorem for this problem. 

Problem-22 Find the complexity of the below recurrence: 

_ |2 Tin - 1) - 1, if n > 0, 
ll, otherwise 

Solution: Let us try solving this function with substitution. 

T(n) = 2T(n - 1) - 1 

T(n) = 2(2T(n - 2) - 1) - 1 = 2 2 T(n - 2) - 2 - 1 

T(n) = 2 2 (2T(n -3)-2-l)-l = 2 3 T(n - 4) - 2 2 - 2 1 - 2° 

T(n) = 2 n T(n - n) - 2”- 1 - 2 n_z - 2 n ~ 3 .... 2 2 - 2 1 - 2° 

T(n) = 2 n - 2 71-1 - 2 n " 2 - 2 n-3 .... 2 2 - 2 1 - 2° 

T(n) =2 n - (2 n - 1) [note: 2 n ~ x + 2 n ' 2 + - + 2° = 2 n ] 

T(n) = 1 

■■•Complexity is 0(1). Note that while the recurrence relation looks exponential the solution to the recurrence relation 
here gives a different result. 

Problem-23 What is the running time of the following function? 
void Function(int n) { 
int i=l, s=l; 
while( s <= n) { 

i++; 
s= s+i; 
printf(“*"); 

} 

} 

Solution: Consider the comments in below function: 
void Function (int n) { 
int i=l, s=l; 

// s is increasing not at rate 1 hut i 
while( s <= n) { 
i++; 
s= s+i; 
printf(“*"); 

} 

) 

We can define the terms ‘s’ according to the relation s f = Sj_ x + i. The value of '£’ increases by one for each iteration. 
The value contained in ‘s' at the i th iteration is the sum of the first T positive integers. If k is the total number of 
iterations taken by the program, then while l oop terminates if: 
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1 + 2 +...+ k = ( 2 +1) > n ==> k = 0(Vn). 

Problem-24 Find the complexity of the function given below, 
void Function(int n) { 
int i, count =0; 
for(i=l; i*i<=n; i++) 
count++; 


Solution: 

void Function(int n) { 
int i, count =0; 
for(i=l; i*i<=n; i++) 
count++; 

1 

In the above function the loop will end, if t 2 < n => T(n) =0(Vn). The reasoning is same as that of Problem-23. 

Problem-25 What is the complexity of the below program: 
void function(int n) { 

int i, j, k , count =0; 
for(i=n/2; i<=n; i++) 

for(j=l; j + n/2<=n; j= j++) 

for(k=l; k<=n; k= k * 2) 
count++; 

} 

Solution: Consider the comments in the following function, 
void function(int n) { 

int i, j, k, count =0; 

//outer loop execute n/2 times 
for(i=n/2; i<=n; i++) 

//Middle loop executes n/2 times 
for(j=l; j + n/2<=n; j= j++) 

//outer loop execute logn times 
for(k=l; k<=n; k= k * 2) 
count++; 

} 

The complexity of the above function is O (n 2 logri). 

Problem-26 What is the complexity of the below program: 
void function(int n) { 

int i, j, k , count =0; 
for(i=n/2; i<=n; i++) 

for(j=l; j<=n; j=2*j) 

for(k=l; k<=n; k= k * 2) 
count++; 

} 

Solution: Consider the comments in the following function, 
void funetion(int n) { 

int i, j, k , count =0; 

//outer loop execute n/2 times 
for(i=n/2; i<=n; i++) 
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//Middle loop executes logn times 
for(j=l; j<=n; j= 2 * j) 

//outer loop execute logn times 
for(k=l; k<=n; k= k*2) 
count++; 

3 

The complexity of the above function is O (nlog 2 n). 

Problem-27 Find the complexity of the below program, 

function/ int n) { 

if(n == 1) return; 
for(int i=l;i<=n;i + + ){ 

for(int j= 1 ; j <= n ; j + + ) { 
printf(“*"); 
break; 

3 

} 

} 

Solution: Consider the comments in the following function, 
function/ int n) { 

//constant time 
if/ n == 1) return; 

//outer loop execute n times 
for/int i=l;i<=n;i + + ){ 

// inner loop executes only time due to break statement, 
for/int j= 1 ; j <= n ; j + + ) { 
printfP"); 
break; 

3 

} 

} 

The complexity of the above function is 0(n). Even though the inner loop is bounded by n, but due to the break 
statement it is executing only once. 

Problem-28 Write a recursive function for the running time T(n ) of the function given below. Prove using the 
iterative method that T(n) = ©(n 3 ). 

function/ int n) { 

if/ n == 1) return; 
for/int i=l;i<=n;i + + ) 

for/int j = 1 ; j <= n ; j + + ) 
printff*"); 

function/ n-3); 

} 

Solution: Consider the comments in below function: 
function (int n) { 

//constant time 
if/ n == 1) return; 

//outer loop execute n times 
for/int i=l;i<=n;i + + ) 

//inner loop executes n times 
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for(int j = 1 ; j <= n ; j + +) 

//constant time 
printf(“*"); 

function/ n-3); 

The recurrence for this code is clearly T(n) — T(n — 3) + cn 2 for some constant c > 0 since each call prints out n 2 
asterisks and calls itself recursively on n - 3. Using the iterative method we get: T(n) = T(n - 3) + cn 2 . Using the 
Subtraction and Conquer master theorem, we get T(ri) = ©(n 3 ). 

Problem-29 Determine © bounds for the recurrence relation: T(n) = 2 T + nlogn. 

Solution: Using Divide and Conquer master theorem, we get OQilog 2 n). 

Problem-30 Determine © bounds for the recurrence: T(n) = T (j) + T (^) + T + n. 

Solution: Substituting in the recurrence equation, we get: T(n) < cl *| + c2 *j+c3*^+cn< k * n , where k 
is a constant. This clearly says ©(n). 

Problem-31 Determine © bounds for the recurrence relation: T(n) = T([ n/2~|) + 7. 

Solution: Using Master Theorem we get ®(logri). 


Problem-32 Prove that the running time of the code below is Cl(logn). 
void Read(int n) { 
int k = 1; 
while/ k < n) 

k = 3*k; 

} 

Solution: The while loop will terminate once the value of 'k' is greater than or equal to the value of 'n'. In each 
iteration the value of ‘k’ is multiplied by 3. If i is the number of iterations, then 'k’ has the value of 3i after i 
iterations. The loop is terminated upon reaching i iterations when 3 b n <-> i > log 3 n, which shows that i = 
O ( logn). 

Problem-33 Solve the following recurrence. 

_(1, ifn= 1 

r(n) = u (n - 1) + n(n -l),ifn>2 

Solution: By iteration: 

T(n) = T(n - 2) + (n - l)(n - 2) + n(n - 1) 


T(n) = T(l) + j] i(i - 1) 

1 = 1 

n n 

T(n) = T(l) + ^ i 2 - ^ i 


n((n + l)(2n + 1) n(n+l) 

T(n) = 1 +- - - - - 

T(n) =©(n 3 ) 

Note: We can use the Subtraction and Conquer master theorem for this problem. 


Problem-34 Consider the following program: 
Fib[n] 

if(n==0) then return 0 

else if(n==l) then return 1 _ 
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else return Fib[n-l]+Fib[n-2] 

Solution: The recurrence relation for running time of this program is: T(n) = T(n — 1) + T(n — 2) + c. Notice T(n) 
has two recurrence calls indicating a binary tree. Each step recursively calls the program for n reduced by 1 and 2, so 
the depth of the recurrence tree is O(n). The number of leaves at depth n is 2 n since this is a full binary tree, and each 
leaf takes at least 0(1) computation for the constant factor. Running time is clearly exponential in n and it is 0(2 n ). 

Problem-35 Running time of following program? 
function(n) { 

for(int i=l;i<=n;i + + ) 

for(int j = 1 ; j <= n ; j+ = i) 
printff * ”) ; 

3 

Solution: Consider the comments in below function: 
function (n) { 

//this loop executes n times 
for(inti = 1; i <= n ; i + +) 

//this loop executes j times with j increase by the rate of i 
for(int j = 1 ; j <= n ; j+ = i ) 
printf( '’*”)» 

) 

In the above code, inner loop executes n/i times for each value of i. Its running time is n X (51f=i n/i) =0 (nlogn). 
Problem-36 What is the complexity of £?=i log i ? 

Solution: Using the logarithmic property, logxy = logx + logy, we can see that this problem is equivalent to 

n 

y logi = log 1 + log 2-\ -1- log n = log( 1 x 2 x ... x n) = log(n\) < log(n n ) < nlogn 

i =1 

This shows that that the time complexity = O (nlogn). 

Problem-37 What is the running time of the following recursive function (specified as a function of the input 
value n)? First write the recurrence formula and then find its complexity. 
function(int n) { 

if(n <= 1) return; 
for (int i=l ; i <= 3; i++ ) 

ftffD; 

} 

Solution: Consider the comments in below function: 
function (int n) { 

//constant time 
if(n <= 1) return; 

//this loop executes with recursive loop of Rvalue 
for (int i=l ; i <= 3; i++ ) 

fcrfl); 

} 

We can assume that for asymptotical analysis k = [k] for every integer k > 1. The recurrence for this code is 
T(n) = 3T(^) + 0(1). Using master theorem, we get T(n) =©(n). 

Problem-38 What is the running time of the following recursive function (specified as a function of the input 
value n)? First write a recurrence formula, and show its solution using induction. 
function(int n) { 
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if(n <= 1) return; 
for (int i=l ; i <= 3 ; i++) 
function (n - 1). 

1 

Solution: Consider the comments in below function: 
function (int n) { 

//constant time 
if(n <= 1) return; 

//this loop executes 3 times with recursive call of n-1 value 
for (int i=l ; i <= 3 ; i++ ) 
function (n-1). 

1 

The if statement requires constant time [0(1)]. With the for loop, we neglect the loop overhead and only count three 
times that the function is called recursively. This implies a time complexity recurrence: 

T(n ) = c, if n < 1; 

= c + 3T(n - 1), if n > 1. 

Using the Subtraction and Conquer master theorem, we get T(n) = ®(3 n ). 

Problem-39 Write a recursion formula for the running time T(n ) of the function whose code is below, 
function (int n) { 

if(n <= 1) return; 
for(int i = 1; i < n; i + +) 
printf(" *"); 
function ( 0.8n ); 

} 

Solution: Consider the comments in below function: 
function (int n) { 

if(n <= 1) return; //constant time 

// this loop executes n times with constant time loop 
for(int i = 1; i < n; i + +) 
printff *"); 

//recursive call with 0.8n 
function (0.8n); 

3 

The recurrence for this piece of code is T(n) = T(. 8n) + O(n) = T(4/5n) + 0(n) — 4/5 T(n) 4- O(n). Applying 
master theorem, we get T (n) = O (n). 

Problem-40 Find the complexity of the recurrence: T(ri) = 2T(Vn) + logn 

Solution: The given recurrence is not in the master theorem form. Let us try to convert this to master theorem format 
by assuming n = 2 m . Applying logarithm on both sides gives, logn = mlogl => m = logn. Now, the given function 
becomes, 

( 77l\ 

2TJ +m. 

To make it simple we assume S(m) = T(2 m ) => S(j) = T( 27) => S(m) = 2S {-) + m. Applying the master theorem 
would result S(m ) = O (mlogm). If we substitute m = logn back, Tin) = SQogri) = O{{logn) loglogn). 

Problem-41 Find the complexity of the recurrence: T(n) = T(-fn) + 1 

Solution: Applying the logic of Problem-40, gives S(m) = 5 (j) + 1. Applying the master theorem would result 
S(m) = O^logm). Substituting m = logn, gives T(n) = S(logn ) =0 (loglogn). 
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Problem-42 Find the complexity of the recurrence: T(ri) = 2 T(yfn) + 1 

Solution: Applying the logic of Problem-40, gives: S(m) = 2S (-j) + 1- Using the master theorem results 5(m) = 
0(m l ° 52 ) = 0(m). Substituting m = logn gives T(n) = 0(logn). 

Problem-43 Find the complexity of the below function, 
int Function (int n) { 

if(n <= 2) return 1; 

else return (Function (floor(sqrt(n))) + 1); 

} 

Solution: Consider the comments in below function: 
int Function (int n) { 

//constant time 

if(n <= 2) return 1; 

else // executes Vn + 1 times 

return (Function (floor(sqrt(n))) + 1); 

} 

For the above code, the recurrence function can be given as: T(n ) = T(y'n) + 1. This is same as that of Problem-41. 

Problem-44 Analyze the running time of the following recursive psuedocode as a function of n. 

void function(int n) { 

if( n < 2 ) return; 

else counter = 0; 

for i = 1 to 8 do 

function (-); 

V’ 

for i =1 to n 3 do 

counter = counter + 1; 

} 

Solution: Consider the comments in below psuedocode and call running time of function(n) as T(n). 
void function(int n) { 

if( n < 2 ) return; //constant time 
else counter = 0; 

// this loop executes 8 times with n value half in every call 
for i = 1 to 8 do 

function (|); 

// this loop executes n 3 times with constant time loop 
for i =1 to n 3 do 

counter = counter + 1; 

} 

T(n) can be defined as follows: 

T(n) = lifn < 2, 

= BTCj) + n 3 + 1 otherwise. 

Using the master theorem gives, T(n) = Q(n l ° 3 ^ logn) = @(n 3 logn). 

Problem-45 Find the complexity of the below psuedocode. 
temp = 1 
repeat 

for i = 1 to n 

temp= temp + 1; 
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until n <= 1 

Solution: Consider the comments in below psuedocode: 
temp = 1 //const time 
repeat // this loops executes n times 
for i = 1 to n 

temp= temp+1; 

//recursive call with - value 

n 

n 

until n <= 1 

The recurrence for this function is T(n ) = T(n/ 2) + n. Using master theorem we get, T(n) = O(n). 

Problem-46 Running time of following program? 
function(int n) { 

for(inti = l ;i<=n;i + + ) 

for(int j = l;j<=n;j* = 2) 
printf( “ * " ); 

3 

Solution: Consider the comments in below function: 
function/int n) { 

for(int i = 1; i <= n ; i + +) // this loops executes n times 

// this loops executes logn times from our logarithms guideline 
for(int j = l;j<=n;j* = 2) 
printf( 

1 ^ 

Complexity of above program is : O (nlogri). 

Problem-47 Running time of following program? 

function(int n) { 

for(int i = 1 ; i <= n/3 ; i + + ) 

for(int j = 1 ; j <= n ; j += 4 ) 
printf( " * " 

3 

Solution: Consider the comments in below function: 

function/int n) [ // this loops executes n/3 times 
for(int i = 1 ; i <= n/3 ; i + +) 

// this loops executes n/4 times 
for(int j = 1 ; j <= n ; j += 4) 
printf( 

3 

The time complexity of this program is : 0(n 2 ). 

Problem-48 Find the complexity of the below function, 

void function(int n) { 
if(n <= 1) return; 
if(n > 1) { 

printf (" * "); 
function/1); 
function/1); 

} 
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} 

Solution: Consider the comments in below function: 
void function(int n) { 

//constant time 
if(n <= 1) return; 
if(n > 1) { 

//constant time 
printf (" *"); 

//recursion with n/2 value 
function( n/2); 

//recursion with n/2 value 
function( n/2 ); 

1 

1 

The recurrence for this function is: T(n) = 2 T + 1. Using master theorem, we get T(n) = O(n). 

Problem-49 Find the complexity of the below function. 
function(int n) { 
int i=l; 
while (i < n) { 
int j=n; 
while (j > 0) 

j = j/2; 

i=2*i; 

1 // i 

} 

Solution: 

function(int n) { 
int i=l; 
while (i < n) { 
int j=n; 
while(j > 0) 

j = j/2; //logn code 
i=2*i; //logn times 

} // i 

} 

Time Complexity: O (logn * logn) =0 (log 2 n). 

Problem-50 T,i<k<n 0 (n), where O(n) stands for order n is: 

(a) O(n) (b) 0(n 2 ) (c) 0(n 3 ) (d) 0(3n 2 ) (e) 0(1.5n 2 ) 

Solution: (b). S 1£k < n 0(n) = O(n) £ lfikfin 1 =0(n 2 ). 

Problem-51 Which of the following three claims are correct 

I (n + k) m = 0(n m ), where k and m are constants II 2 n+1 = 0(2 n ) III 2 2n+1 = 0(2”) 

(a) I and II (b) I and III (c) II and III (d) I, II and III 

Solution: (a). (I) (n 4- k) m = n k + cUn* -1 + ... k m = &(n k ) and (II) 2 n+1 = 2*2” = 0(2”) 

Problem-52 Consider the following functions: 

f(n) =2” g(n) = n! h(n) =n l09n 

_ Which of the following statements about the asymptotic behaviour of f(n), g(n), and h(n) is true? 
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(A) f(n) = 0(g(n)); g(n) = 0(h(n)) (B) f(n) = O (g(n)); g(n) = 0(h(n)) 

(C) g(n) = 0(f(n)); h(n) = 0(f(n)) (D) h(n) = 0(f(n)); g(n) - Q (f(n)) 

Solution: (D). According to rate of growths: h(n) < f(n) < g(n) (g(n) is asymptotically greater than f(n) and f(n) is 
asymptotically greater than h(n)). We can easily see above order by taking logarithms of the given 3 functions: 
lognlogn <n< log(nl). Note that, log(n\) = O (nlogn). 

Problem-53 Consider the following segment of C-code: 
int j=l, n; 
while (j <=n) 

j = j* 2 ; 

The number of comparisons made in the execution of the loop for any n > 0 is: 

(A) ceil (log%)+ 1 (B) n (C) ceihfo^) (D) floor (log?) + 1 

Solution: (a). Let us assume that the loop executes k times. After k th step the value of j is 2 k . Taking logarithms on 
both sides gives k = log" ■ Since we are doing one more comparison for exiting from loop, the answer is ceil(log”)+ 1. 

Problem-54 Consider the following C code segment. Let T(n) denotes the number of times the for loop is 
executed by the program on input n. Which of the following is TRUE? 
int IsPrime(int n){ 
for(int i=2;i<=sqrt(n);i++) 
if(n%i == 0) 

{ printf(“Not Prime\n”); return 0; } 

return 1; 

} 

(A) T(n) = O (Vn) and T(n) = Q(Vn) (B) T(n) = O (Vn) and T(n) = Q(l) 

(C) T(n) = O(n) and T(n) = Q(fn) (D) None of the above 

Solution: (B). Big O notation describes the tight upper bound and Big Omega notation describes the tight lower bound 
for an algorithm. The for loop in the question is ran maximum Vn times and minimum 1 time. Therefore, T(n) = 
0(Vn)andT(n) = f2(l). 

Problem-55 In the following C function, let n > m. How many recursive calls are made by this function? 
int gcd(n,m){ 

if (n%m ==0) return m; 
n = n%m; 
return gcd(m,n); 

] 

(A )®(log%) (B) Q(n) (C) ®(log 2 log%) (D) @(n) 

Solution: No option is correct. Big O notation describes the tight upper bound and Big Omega notation describes the 
tight lower bound for an algorithm. For m = 2 and for all n = 2 l , running time is 0(1) which contradicts every option. 

Problem-56 Suppose T(ri) = 2T(n/2) 4- n, T(0)=T(1)=1. Which one of the following is FALSE? 

(A) T(n) = 0(n 2 ) (B) T(n) = ®(nlogn) (C) T(n) = Q(n 2 ) (D) T(n) = O(nlogn) 

Solution: (C). Big O notation describes the tight upper bound and Big Omega notation describes the tight lower bound 
for an algorithm. Based on master theorem, we get T(n) = ®{nlogn). This indicates that tight lower bound and tight 
upper bound are same. That means, O (nlogn) and Q (nlogn) are correct for given recurrence. So option (C) is wrong. 
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Chapter 2 ANALYSIS OF ALGORITHMS 


Introduction 

The objective of this chapter is to explain the importance of analysis of algorithms, their notations, 
relationships and solving as many problems as possible. We first concentrate on understanding the 
importance of analysis and then slowly move towards analyzing the algorithms with different notations and 
finally, the problems. After completion of this chapter you should be able to find the complexity of any 
given algorithm (especially recursive functions). 

What is an Algorithm? 

Just to understand better, let us consider the problem of preparing an omelet. For preparing omelet, general 
steps which we follow are: 

1) Get the frying pan. 

2) Get the oil. 

a. Do we have oil? 

i. If yes, put it in the pan. 

ii. If no, do we want to buy oil? 

1. If yes, then go out and buy. 

2. If no, we can terminate. 

3) Turn on the stove, etc.. 

What we are doing is, for a given problem (preparing an omelet), giving step by step procedure for solving 
it. Formal definition of an algorithm can be given as: 

An algorithm is the step-by-step instructions to a given problem. 

One important note to remember while writing the algorithms is: we do not have to prove each step of the 
algorithm. 

Why Analysis of Algorithms? 

If we want to go from city "A" to city "B". There can be many ways of doing this: by flight, by bus, by train 
and also by cycle. Depending on the availability and convenience we choose the one which suits us. 
Similarly, in computer science there can be multiple algorithms exist for solving the same problem (for 


57 


Analysis of Algorithms | Introduction 







www.CareerMonk.com 


Data Structures and Algorithms Made Easy 


example, sorting problem has lot of algorithms like insertion sort, selection sort, quick sort and many more). 
Algorithm analysis helps us determining which of them is efficient in terms of time and space consumed. 

Goal of Analysis of Algorithms? 

The goal of analysis of algorithms is to compare algorithms (or solutions) 
time but also in terms of other factors (e.g., memory, developer's effort etc.) 

What is Running Time Analysis? 

It is the process of determining how processing time increases as the size 
increases. Input size is number of elements in the input and depending on the 
be of different types. In general, we encounter the following types of inputs. 

• Size of an array 

• Polynomial degree 

• Number of elements in a matrix 

• Number of bits in binary representation of the input 

• Vertices and edges in a graph 

How to Compare Algorithms? 

To compare algorithms, let us define some objective measures. 

Execution times? Not a good measure as execution times are specific to a particular computer. 

Number of statements executed? Not a good measure since the number of statements varies with the 
programming language as well as the style of the individual programmer. 

Ideal Solution? 

Let us assume that we expressed running time of given algorithm as a function of the input size n (i.e., 
f(nj). We can compare these different functions corresponding to running times and this kind of 
comparison is independent of machine time, programming style, etc.. 

What is Rate of Growth? 

The rate at which the running time increases as a function of input is called rate of growth. Let us assume 
that you went to a shop for buying a car and a cycle. If your friend sees you there and asks what you are 
buying then in general we say buying a car. This is because cost of car is too big compared to cost of cycle ( 
approximating the cost of cycle to cost of car). 

Total Cost = cost_of_car + cost_of_cycle 
Total Cost ~ cost_of_car (approximation) 


mainly in terms of running 


of the problem (input size) 
problem type the input may 
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For the above example, we can represent the cost of car and cost of cycle in terms of function and for a 
given function we ignore the low order terms that are relatively insignificant (for large value of input size, 
n). As an example in the below case, n 4 , 2n 2 , lOOn and 500 are the individual costs of some function and 
we approximate it to n 4 . Since, n 4 is the highest rate of growth. 

n 4 + 2n 2 + lOOn + 500 « n 4 


Commonly used Rate of Growths 

Below diagram shows the relationship between different rates of growth. 
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Below is the list of rate of growths which come across in remaining chapters. 


Time complexity 

Name 

Example 

1 

Constant 

Adding an element to the front of a linked list 

logn 

Logarithmic 

Finding an element in a sorted array 

n 

Linear 

Finding an element in an unsorted array 

nlogn 

Linear Logarithmic 

Sorting n items by ‘divide-and-conquer’-Mergesort 

9 

7T 

Quadratic 

Shortest path between two nodes in a graph 

n 3 

Cubic 

Matrix Multiplication 

2” 

Exponential 

The Towers of Hanoi problem 


Types of Analysis 

If we have an algorithm for a problem and want to know on what inputs the algorithm is taking less time 
(performing well) and on what inputs the algorithm is taking huge time. 

We have already seen that an algorithm can be represented in the form of an expression. That means we 
represent the algorithm with multiple expressions: one for case where it is taking the less time and other for 
case where it is taking the more time. In general the first case is called the best case and second case is 
called the worst case for the algorithm. 

To analyze an algorithm we need some kind of syntax and that forms the base for asymptotic 
analysis/notation. There are three types of analysis: 

• Worst case 

o Defines the input for which the algorithm takes huge time, 
o Input is the one for which the algorithm runs the slower. 

• Best case 

o Defines the input for which the algorithm takes lowest time, 
o Input is the one for which the algorithm runs the fastest. 

• Average case 

o Provides a prediction about the running time of the algorithm 
o Assumes that the input is random 

Lower Bound <= Average Time <= Upper Bound 

For a given algorithm, we can represent best case, worst case, and average case analysis in the form of 
expressions. As an example, let f(n) be the function which represents the given algorithm. 

f(n ) = n 2 + 500, for worst case 

f(n ) = n + lOOn + 500 , for best case 
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Similarly, for average case too. The expression defines the inputs with which the algorithm takes the 
average running time (or memory). 

Asymptotic Notation? 

Having the expressions for best case, average case and worst case, for all the three cases we need to identify 
the upper bound, lower bounds. In order to represent these upper bound and lower bounds we need some 
syntax and that is the subject of following discussion. Let us assume that the given algorithm is represented 
in the form of function /(n). 


Big-O Notation 

This notation gives the tight upper bound of the given function. Generally we represent it as/(n) = 
0(g(n)). That means, at larger values of n, the upper bound of f(n ) is g(n). 

For example, if/(n) = n 4 + 100n 2 + lOn + 50 is the given algorithm, then n 4 is g(n). That means g(n) 
gives the maximum rate of growth for / (n) at larger values of n. 


Let us see the 0 —notation with little more detail. 0 —notation defined as 0(g(n)) = { f(n ): there exist 
positive constants c and n 0 such that 0 < f(n ) < cg(n ) for all n > n 0 ). g(n) is an asymptotic tight upper 
bound for f(n). Our objective is to give smallest rate of growth g(n ) which is greater than or equal to given 
algorithms rate of growth /(n). 

In general, we discard lower values of n. That means the rate of growth at lower values of n is not 
important. In the below figure, n 0 is the point from which we need to consider the rate of growths for a 
given algorithm. Below n 0 the rate of growths could be different. 


Big-O Visualization 


Rate of Growth cg(ri) 



0(g(n)) is the set of functions with smaller or same order of growth as g(ri). For example, 0 (n 2 ) includes 
0(1), 0(n), 0(nlogri) etc.. 


Note: Analyze the algorithms at larger values of n only. What this means is, below n 0 we do not care for 
rate of growth. 
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Big-0 Examples 

Example-1 Find upper bound for f(n) = 3n + 8 

Solution: 3n + 8 < 4n, for all n > 1 

3n + 8 = 0(n) with c = 4 and n 0 — 8 

Example-2 Find upper bound for f(n) = n 2 + 1 

Solution: n 2 + 1 < 2n 2 , for all n > 1 

n 2 + 1 = 0(n 2 ) with c = 2 and n 0 = 1 

Example-3 Find upper bound for f(n ) = n 4 + 10 On 2 +50 

Solution: n 4 + 100n 2 + 50 < 2n 4 , for all n > 1 

n 4 + 100n 2 + 50 = 0(n 4 ) with c = 2 and n 0 = 100 

Example-4 Find upper bound for f(n) = 2 n 3 — 2n 2 

Solution: 2n 3 — 2n 2 < 2n 3 , for all n > 1 

••• 2n 3 — 2n 2 = 0(2n 3 ) with c = 2 and n 0 = 1 

Example-5 Find upper bound for f(n ) = n 

Solution: n < n 2 , for all n > 1 

n = 0 (n 2 ) with c — 1 and n 0 = 1 

Example-6 Find upper bound for f(n ) = 410 

Solution: 410 < 410, for all n > 1 

100 = 0(1 ) with c = 1 and n 0 = 1 
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No Uniqueness? 

There is no unique set of values for n 0 and c in proving the asymptotic bounds. Let us consider, 10 On + 
5 = 0(n 2 ). For this function there are multiple n 0 and c values possible. 

Solutionl: lOOn + 5 < lOOn + n = lOln < 101n 2 foralln > 5, n 0 = 5 and c = 101 is a solution. 

Solution2: lOOn + 5 < lOOn + 5n = 105n < 105n 2 for all n > l,n 0 = 1 and c = 105 is also a 
solution. 


Omega-Q Notation 

Similar to 0 discussion, this notation gives the tighter lower bound of the given algorithm and we represent 
it as f(n) = Q(g(n)). That means, at larger values of n, the tighter lower bound of /(n) is g(n). 

For example, if f(n) — lOOn 2 + lOn + 50, g(n) is Q(n 2 ). 

The ft notation can be defined as ft(g(n)) = (f(n): there exist positive constants c and n 0 such that 
0 < cg(n ) < /(n) for all n > n 0 }. g(n) is an asymptotic tight lower bound for f(n). Our objective is to 
give largest rate of growth g(n ) which is less than or equal to given algorithms rate of growth / (n). 


Rate of Growth 



Q Examples 

Example-1 Find lower bound for/(n) = 5 n 2 

Solution: 3 c, n 0 Such that: 0 <cn <5n 2 =>cn <5 n 2 => c = 1 and n 0 = 1 
5n 2 = Q(n) with c — 1 and n 0 = 1 


Example-2 Prove/(n) = lOOn + 5 ^ Q(n 2 ) 

Solution: 3 c, n 0 Such that: 0 <cn 2 <100n + 5 

lOOn + 5 <100n + 5n ( Vn >1) = 105n 
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cn 2 <105n =>n(cn - 105) <0 
Since n is positive =>cn- 105 <0 =>n <105/c 
=> Contradiction: n cannot be smaller than a constant 

Example-3 n = Q(2 n), n 3 = Q(n 2 ), n = Q.(logri) 

Theta-0 Notation 

This notation decides whether the upper and lower bounds of a given function (algorithm) are same or not. 
The average running time of algorithm is always between lower bound and upper bound. If the upper 
bound (0) and lower bound (Q) gives the same result then 0 notation will also have the same rate of 
growth. As an example, let us assume that f(n) = 10 n + n is the expression. Then, its tight upper bound 
g(n ) is 0(n). The rate of growth in best case is g(ii) = 0(n). In this case, rate of growths in best case and 
worst are same. As a result, the average case will also be same. For a given function (algorithm), if the rate 
of growths (bounds) for 0 and Q are not same then the rate of growth 6 case may not be same. 


Rate of Growth 



Now consider the definition of 0 notation. It is defined as @(g(n )) = {/(n): there exist positive constants 
c 1 ,c 2 and n 0 such that 0 < c ± g(n ) < f(n ) < c 2 g(n) for all n > n 0 ). g(n) is an asymptotic tight bound 
for f(n). Q(g(n)) is the set of functions with the same order of growth as g{n). 

© Examples 


Example-1 Find 0 bound for /(n) = ^— j 
Solution: < -— - < n 2 , for all, n > 1 

5 2 2 

^— j = 0 (n 2 ) with c x = 1/5, c x — 1 and n 0 = 1 
Example-2 Prove n ^ B(n 2 ) 

Solution: c x n 2 < n < c 2 n 2 => only holds for: n < 1 /c 1 
n ^ 0(n 2 ) 
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Example-3 Prove 6n 3 ± 0(n 2 ) 

Solution: c x n 2 < 6n 3 < c 2 n 2 => only holds for: n < c 2 /6 
6 n 3 + 0(n 2 ) 

Example-4 Prove n =f= d(logn) 

Solution: c^ogn < n < c 2 logn => c 2 > n o ~ Impossible 

Important Notes 

For analysis (best case, worst case and average) we try to give upper bound (0) and lower bound (Q) and 
average running time (0). From the above examples, it should also be clear that, for a given function 
(algorithm) getting upper bound (0) and lower bound (Q) and average running time (0) may not be possible 
always. For example, if we are discussing the best case of an algorithm, then we try to give upper bound (0) 
and lower bound (Q) and average running time (0). 

In the remaining chapters we generally concentrate on upper bound (0) because knowing lower bound (Q) 
of an algorithm is of no practical importance and we use 0 notation if upper bound (0) and lower bound (Q) 
are same. 

Why is it called Asymptotic Analysis? 

From the above discussion (for all the three notations: worst case, best case and average case), we can easily 
understand that, in every case for a given function f(n ) we are trying to find other function g(n) which 
approximates f(n ) at higher values of n. That means, g(ji) is also a curve which approximates /(n) at 
higher values of n. In mathematics we call such curve as asymptotic curve. In other terms, g(n) is the 
asymptotic curve for f(n). For this reason, we call algorithm analysis as asymptotic analysis. 

Guidelines for Asymptotic Analysis? 

There are some general rules to help us in determining the running time of an algorithm. Below are few of 
them. 

1) Loops: The running time of a loop is, at most, the running time of the statements inside the loop 
(including tests) multiplied by the number of iterations. 

// executes n times 
for (i=l; i<=n; i++) 

[ 

m = m + 2; // constant time, c 

} 
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Total time = a constant c x n = c n = 0(n ). 

2) Nested loops: Analyze from inside out. Total running time is the product of the sizes of all the loops. 

//outer loop executed n times 
for (i=l; i<=n; i++) 

I 

// inner loop executed n times 
for (j=l; j<=n; j++) 

1 

k = k+1; //constant time 

3 

3 

Total time = c x n x n = cn 2 = 0(n 2 ). 

3) Consecutive statements: Add the time complexities of each statement. 

x = x +1; //constant time 

// executed n times 
for (i=l; i<=n; i++) 

1 

m = m + 2; //constant time 

} 

//outer loop executed n times 
for (i=l; i<=n; i++) 

1 

//inner loop executed n times 
for (j=l; j<=n; j++) 

1 

k = k+1; //constant time 

3 

} 


Total time = c 0 + cyn + c 2 n 2 = 0 (n 2 ). 

4) If-then-else statements: Worst-case running time: the test, plus either the then part or the else part 
(whichever is the larger). 

//test: constant 

if (length () != otherStack. length ()) 

1 

return false; //then part: constant 
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} 

else 

I 

// else part: (constant + constant) * n 
for (int n = 0; n < length(); n++) 

1 

// another if: constant + constant (no else part) 
if (!list[n].equals(otherStack.list[n])) 

//constant 
return false; 

3 

3 

Total time = c 0 + ^ + (c 2 + c 3 ) * n = 0(n). 

5) Logarithmic complexity: An algorithm is 0 (logn) if it takes a constant time to cut the problem size by a 
fraction (usually by V 2 ). 

As an example let us consider the following program: 


for (i=l; i<=n;) 

1 

i = i*2; 

} 

If we observe carefully, the value of i is doubling every time. Initially i = 1, in next step i = 2, and in 
subsequent steps i = 4,8 and so on. 

Let us assume that the loop is executing some k times. That means at k th step 2 1 = n and we come out 
of loop. So, if we take logarithm on both sides, 

log( 2‘) = logn 
ilog2 = logn 

i = logn //if we assume base-2 
So the total time = 0(logn). 

Note: Similarly, for the below case also, worst case rate of growth is 0(logn). That means, the same 
discussion holds good for decreasing sequence also. 


for (i=n; i<=l;) 

1 

i = i/2; 

3 
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Another example algorithm is binary search: finding a word in a dictionary of n pages 

• Look at the centre point in the dictionary 

• Is word towards left or right of centre? 

• Repeat process with left or right part of dictionary until the word is found 

Properties of Notations 

• Transitivity: f(n ) = @(g(n )) and g(n) = @(h(n)) => /(n) = @(h(n)). Valid for 0 and Q also. 

• Reflexivity: /(n) = 0(/(n)). Valid for 0 and Q also. 

• Symmetry:/(n) = 0(^(n)) if and only if g(n) = 0(f(n)). 

• Transpose symmetry: f(n) = 0(g(n)) if and only if g(n) = Q(/(n)). 


Commonly used Logarithms and Summations 

Logarithms 

log x y = y log x 
logn = log™ 
log xy = logx + logy 
log k n = (logn) k 
log logn = log (logn) 


log - — logx - logy 


a logi = logt J x = iS£| 
yb iog b a 


Arithmetic series 


2 >= 

K= 1 

trie: 

n 

I 


= 1 + 2 + - + n = 


n(n + 1) 


K= 1 

Geometric series 


„n+1 


1 + X + X 2 ... + X r 


k=1 

Harmonic series 

n 


X — 1 


(X * 1) 


Z 1 1 1 

- = 1 + -+ logi 

k 2 n 

k =1 

Other important formtdae 

n 

yT log k ~ nlogn 


k —1 
n 

I 

fe=i 


fcP= ip+ 2 P + —I- n p 


p + 1 


-p+i 


Master Theorem for Divide and Conquer 
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In all divide and conquer algorithms we divide the problem into subproblems, each of which is some part of 
the original problem, and then perform some additional work to compute the final answer. As an example, 
if we consider merge sort [for details, refer Sorting chapter], it operates on two subproblems, each of 
which is half the size of the original, and then uses 0(n) additional work for merging. This gives the 
running time equation: 


T(ri) = 2 T0 + O(n) 

The following theorem can be used to determine the running time of divide and conquer algorithms. For a 
given program (algorithm), first we try to find the recurrence relation for the problem. If the recurrence is 
of the below form then we directly give the answer without fully solving it. 

If the recurrence is of the form T(n) = aT(^) + 9(n k log v n), where a > 1 , b > 1, k > 0 and p is a real 
number, then: 

1) Ifa> b k , then T(n) = 0(n lO9b ^ 

2) If a = b k 

a. If p > — 1, then T(n) = 9(n l ° 9b log p+1 n) 

b. If p = — 1, then T(n) = 0(n l ° 9b loglogn) 

c. If p < — 1, then T(n) = 9(n l ° 9b ) 

3) If a < b k 

a. lip > 0, then T(n) = 0[n k log v n) 

b. lip < 0, then T(n) = 0(n fc ) 

Problems Divide and Conquer Master Theorem 

For each of the following recurrences, give an expression for the runtime T(n) if the recurrence can be 
solved with the Master Theorem. Otherwise, indicate that the Master Theorem does not apply. 

Problem-1 T(n ) = 3 T (n/2) + n 2 

Solution: T(ji) = 3 T (n/2') + n 2 => T (n) = 0(n 2 ) (Master Theorem Case 3.a) 

Problem-2 T(n) = AT (n/2) + n 2 

Solution: T(n) = AT (n/2) + n 2 => T (n) = 0(n 2 logn) (Master Theorem Case 2.a) 

Problem-3 T(n) = 7 (n/2) + 2" 

Solution: T(n) = T(n/2) + 2” => 0(2”) (Master Theorem Case 3.a) 

Problem-4 T (n) = 2 n T(n/2) + n" 

Solution: T(n) = 2 n T(n/2) + n” => Does not apply (a is not constant) 

Problem-5 T(n) = 16T(n/A) + n 
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Solution: T(ri) = 167 (n/4) + n => T(ri) = 0(n 2 ) (Master Theorem Case 1) 

Problem-6 T(n) - 2T(n/2) + nlogn 

Solution: T(n ) = 2T(n/2)+ nlogn => T(n ) = 0(nlog 2 n) (Master Theorem Case 2.a) 
Problem-7 T(n ) = 27’(n/2) + n/logn 

Solution: T{n) = 2T(n/2) + n/logn —>T(n ) = 0(nloglogn) (Master Theorem Case 2.b) 
Problem-8 7’(n) = 27' (n/4) + n 051 

Solution: 7’(n) = 27’(n/4) + n 0 51 => T (n) = 0(n 0 - 51 ) (Master Theorem Case 3.b) 
Problem-9 T(n ) = 0.57'(n/2) + 1/n 

Solution: 7’(n) = 0.57(n/2) + 1/n => Does not apply (a < 1) 

Problem-10 T (n) = 6T (n/3) + n 2 log n 

Solution: T(n) = 6T(n/3) + n 2 logn —> T(n ) = 0(n 2 logn) (Master Theorem Case 3.a) 
Problem-11 7’(it) = 647’(n/8) — n 2 logn 

Solution: 7'(n) = 647’(n/8) — n 2 logn => Does not apply (function is not positive) 
Problem-12 T (n) = 77’(n/3) + n 2 

Solution: 7’(n) = 77’(n/3) + n 2 => T(n) = 0(n 2 ) (Master Theorem Case 3.as) 
Problem-13 T (n) = 47’ (n/2) + logn 

Solution: 7’(n) = 47’(n/2) + logn => T(n) = 0(n 2 ) (Master Theorem Case 1) 
Problem-14 T (n) = 167’ (n/4) + n! 

Solution: T(n ) = 167’ (n/4) + n! => T(n ) = 0(n!) (Master Theorem Case 3.a) 
Problem-15 T (n) = V2T (n/2) + logn 

Solution: 7’(n) = V27' (n/2) + logn => T(n ) = 0(Vn) (Master Theorem Case 1) 
Problem-16 T (n) = ST 1 (n/2) + n 

Solution: 7’(n) = 37’ (n/2) + n =>T(n ) = 0 (n !ofl3 ) (Master Theorem Case 1) 
Problem-17 T’(n) = ST 1 (n/3) + Vn 

Solution: 7’(n) = 37’(n/3) + Vn => T’(n) = 0(n) (Master Theorem Case 1) 

Problem-18 T(n ) = 47’ (n/2) + cn 

Solution: T(n) = 47’ (n/2) + cn => T(n) = 0 (n 2 ) (Master Theorem Case 1) 
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Problem-19 T(ri) = 3 T (n/ 4) + nlogn 

Solution: T(n ) = 37 (n/4) + nlogn => T(n) = 6(nlogn) (Master Theorem Case 3.a) 

Problem-20 7 (n) = 3 7 (n/3) + n/2 

Solution: 7(n) = 37 (n/3) + n/2 => 7 (n) = 0(nlogn) (Master Theorem Case 2.a) 

Master Theorem for Subtract and Conquer Recurrences 

Let 7(n) be a function defined on positive n, and having the property 

T( . _ f c, ifn < 1 

^ n ' (aT(n — b) + f(n), if n > 1 

for some constants c, a > 0 ,b > 0, k > 0, and function /(n). If/(n) is in 0(n k ), then 

f0(n k ), if a < 1 

T(n) = | 0(n k+1 '), if a — 1 

[o (n k ab^j, if a > 1 

Variant of subtraction and conquer master theorem 

The solution to the equation T(ri) = T(ari) + '/'(('l — a)n) + [in, where 0 < a < 1 and f> > 0 are 
constants, is 0(nlogn). 

Problems on Algorithms Analysis 

Note: From the following problems, try to understand in what cases we get different complexities 
(0(n),0(logn),0(loglogn) etc..). 

Problem-21 Find the complexity of the below recurrence: 

r( - n ) = f 37 X n - 1 )’ i / n>0 ’ 

(1, otherwise 

Solution: Let us try to solve this function with substitution. 

T(n) = 3 T(n - 1) 

T(n) = 3(3 T(n - 2)) = 3 2 T(n - 2) 

T(n) = 3 2 (37(n — 3)) 


T(n) = 3 n T(n - n) = 3 n T(0) 

= 3” 
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This clearly shows that the complexity of this function is 0(3"). 

Note: We can use the Subtraction and Conquer master theorem for this problem. 

Problem-22 Find the complexity of the below recurrence: 

f27(n-l)-l,i/n>0, 
ll, otherwise 

Solution: Let us try to solve this function with substitution. 

T(n) = 2 T(n - 1) - 1 

T(n) = 2(2T(n - 2) - 1) - 1 = 2 2 T(n - 2) - 2 - 1 

r(n) = 2 2 (2T(n -3)-2-l)-l = 2 3 T(n - 4) - 2 2 - 2 1 - 2° 

T(n) = 2 n T(n - n) - 2"- 1 - 2"“ 2 - 2"“ 3 .... 2 2 - 2 1 - 2° 

T(n) = 2" - 2" _1 - 2" -2 - 2" -3 .... 2 2 - 2 1 - 2° 

T{n) = 2" - (2 n - 1) [note: 2"" 1 + 2"“ 2 + + 2° = 2"] 

Tin) = 1 

Complexity is 0(1). Note that while the recurrence relation looks exponential the solution to the 
recurrence relation here gives a different result. 

Problem-23 What is the running time of the following function (specified as a function of the input 
value n) 

void Function(int n) 

I 

int i=l ; 
int s=l ; 
while( s <= n) 

1 

i++; 
s= s+i; 
print(V'); 

3 

3 

Solution: Consider the comments in below function: 
void Function (int n) 

1 

int i=l ; 
int s=l ; 

// s is increasing not at rate 1 but i 
while( s <= n) 

1 

i++; 
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s= s+i; 
print(“*"); 

} 

} 

We can define the terms ‘s’ according to the relation s,= Sj_ j + i. The value of ‘i’ increases by one for each 
iteration. So the value contained in 's' at the i th iteration is the sum of the first ‘i’ positive integers. If k is 
the total number of iterations taken by the program, then the while loop terminates once. 

k(k + 1 ) 

1 + 2 + ... + /c = ---> n => k = O(Vn). 

Problem-24 Find the complexity of the function given below, 
void Function (int n) 

1 

int i, count =0;; 
for(i=l; i*i<=n; i++) 
count++; 

} 

Solution: Consider the comments in below function: 
void Function(int n) 

1 

int i, count =0;; 
for(i=l; i*i<=n; i++) 
count++; 

1 

In the above function the loop will end, if i 2 < n => T(n ) = 0(\fn). The reasoning is same as that of 
Problem-23. 

Problem-25 What is the complexity of the below program: 
void function(int n) 

1 

int i, j, k , count =0; 
for(i=n/2; i<=n; i++) 

for(j=l; j + n/2<=n; j= j++) 

for(k=l; k<=n; k= k * 2) 
count++; 

3 

Solution: Consider the comments in the following function, 
void function(int n) 

1 

int i, j, k , count =0; 
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//outer loop execute n/2 times 
for(i=n/2; i<=n; i++) 

//Middle loop executes n/2 times 
for(j=l; j + n/2<=n; j= j++) 

//outer loop execute logn times 
for(k=l; k<=n; k= k * 2) 
count++; 

} 

The complexity of the above function is O(n 2 logn). 

Problem-26 What is the complexity of the below program: 
void function(int n) 

1 

int i, j, k , count =0; 
for(i=n/2; i<=n; i++) 

for(j=l; j<=n; j=2*j) 

for(k=l; k<=n; k= k * 2) 
count++; 

} 

Solution: Consider the comments in the following function, 
void function(int n) 

1 

int i, j, k , count =0; 

//outer loop execute n/2 times 
for(i=n/2; i<=n; i++) 

//Middle loop executes logn times 
for(j=l; j<=n;j=2*j) 

//outer loop execute logn times 
for(k=l; k<=n; k= k*2) 
count++; 

} 

The complexity of the above function is 0(nlog 2 n). 

Problem-27 Find the complexity of the below program. 
function( int n ) 

1 

if (n == 1) return; 
for( i=l ;i<=n;i + + ) 

1 

for( j= 1 ; j <= n ; j + + ) 

1 

print(“*"); 
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} 


} 


break; 

} 


www.CareerMonk.com 


Solution: Consider the comments in the following function, 
function (int n ) 

1 

//constant time 
if ( n == 1 ) return; 

//outer loop execute n times 
for( i=l ;i<=n;i + + ) 

1 

// inner loop executes only time due to break statement. 
for( j= 1 ; j <= n ; j + + ) 

1 

print(“*") ; 
break; 

3 

} 


The complexity of the above function is 0(ri). Even though the inner loop is bounded by n, but due to the 
break statement it is executing only once. 

Problem-28 Write a recursive function for the running time T(n ) of the function function, whose code 
is below. Prove using the iterative method that T(n) = 0(n 2 ). 
function( int n ) 

1 

if ( n == 1 ) return ; 
for( i=l ;i<=n;i + + ) 

for( j = 1 ; j <= n ; j + + ) 
print(“*") ; 

function( n-3); 

} 

Solution: Consider the comments in below function: 


function (int n) 

1 

//constant time 
if ( n == 1 ) return ; 

//outer loop execute n times 
for( i=l ;i<=n;i + + ) 
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3 


//inner loop executes n times 
for( j = l ; j <= n ; j + + ) 
//constant time 
print/) ; 

function/ n-3); 


The recurrence for this code is clearly T(n) = T(n — 3) + cn 2 for some constant c > 0 since each call 
prints out n 2 asterisks and calls itself recursively on n - 3. Using the iterative method we get: 

T(n) = T(n — 3) + cn 2 

Using the Subtraction and Conquer master theorem, we get T(n) = 6 (n 3 ). 

Problem-29 Determine 0 bounds for the recurrence relation: T(n ) = 27' + nlogn 

Solution: Using Divide and Conquer master theorem, we get 0(n log 2 n). 

Problem-30 Determine 0 bounds for the recurrence: T (n) = T(n/2) + T(n/4) + T(n/ 8) + n 

Solution: Substituting in the recurrence equation, we get: 

n n n 

T(n) < cl * — + c2 * — + c3 * — + cn 

< k * n, where k is a constant. 

Problem-31 Determine 0 bounds for the recurrence relation: T(n) = T([n/2~]) + 7 
Solution: Using Master Theorem we get Q(logn). 

Problem-32 Prove that the running time of the code below is Cl(logri). 

Read/int n); 

1 

int k = 1 ; 
while/ k < n ) 
k=3k; 

} 


Solution: The while loop will terminate once the value of 'k' is greater than or equal to the value of V. 
Since each loop the value of ‘k’ is being multiplied by 3, if i is the number of iterations, then ‘k’ has the 
value of 3i after i iterations. That is the loop is terminated upon reaching i iterations when 3 ! > n <-> i > 
log 3 n, which shows that i = ft ( logn ). 

Problem-33 Solve the following recurrence. 
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T{n) 


inn- 


if n — 

1) + n(n — 1), if n> 


1 

2 


Solution: By iteration: 


Tfn) = Tin — 2 ) + (n — l)(n — 2) + n(n — 1) 


Tin) = Til) + £ £(£ - 1) 

;=i 

n n 

i=l i=l 

n((n + l)(2n + 1) n(n + 1) 
7(n) = i +-g-— 

Tin) = 0(n 3 ) 


Note: We can use the Subtraction and Conquer master theorem for this problem. 


Problem-34 Consider the following program: 
Fibfn] 

if (n==0) then return 0 
else if (n==l) then return 1 
else return Fib[n-l]+Fib[n-2] 


Tin) = Til) + £ 


Solution: The recurrence relation for running time of this program is 

Tin) = Tfn - 1) + Tfn - 2) + c. 

Notice T(n) has two recurrence calls indicating a binary tree. Each step recursively calls the program for n 
reduced by 1 and 2, so the depth of the recurrence tree is 0(n). The number of leaves at depth n is 2" since 
this is a full binary tree, and each leaf takes at least 0(1) computation for the constant factor. Running time 
is clearly exponential in n. 


Problem-35 Running time of following program? 
fimction(n) 

[ 


} 


for( i=l ;i<=n;i + + ) 

for( j = 1 ; j <= n ; j+ = i) 
print( ) ; 


Solution: Consider the comments in below function: 
function (n) 

[ 

//this loop executes n times 
for( i=l ;i<=n;i + + ) 


77 


Analysis of Algorithms | Problems on Algorithms Analysis 








Data Structures and Algorithms Made Easy 


^ www.CareerMonk.com 


//this loop executes j times with j increase by the rate of i 
for( j = 1 ; j <= n ; ]+ = i) 
print( ); 


} 


Its running time is n x (\fn) = 0(n 2 )[since the inner loop is same as that of Problem-23]. 


Problem-36 What is the complexity of 1 l°9 i ? 


Solution: Using the logarithmic property, 

equivalent to 
n 


I 


logi = log 


1 + 


log 2 -I-h log n 


logxy = logx + logy, we can see that this problem is 


= log(l x 2 x ... x n) 

= log(n!) 

< log(n n ) 

< nlogn 

This shows that that the time complexity = 0(nlogn). 


Problem-37 What is the running time of the following recursive function (specified as a function of the 
input value n)? First write the recurrence formula and then find its complexity. 
function(int n) 

1 

if (n <= 1) 

return ; 


for (int i=l ; i <= 3; i++ ) 

} 

Solution: Consider the comments in below function: 
function (int n) 

1 

//constant time 
if (n <= 1) 

return ; 

//this loop executes with recursive loop of | value 
for (int i=l ; i <= 3; i++ ) 

f(T|l); 

3 
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We can assume that for asymptotical analysis k = [k] for every integer k > 1. The recurrence for this 
code is T (n) = 3T(j) + 0(1). 

Using master theorem, we get T (n) = 0 (n). 


Problem-38 What is the running time of the following recursive function (specified as a function of the 
input value ri)7 First write a recurrence formula, and show its solution using induction. 
function(int n) 

1 

if (n <= 1) 

return; 


3 


for (i=l ; i <= 3 ; i++ ) 

function (n - 1). 


Solution: Consider the comments in below function: 


function (int n) 

1 

//constant time 
if (n <= 1) 

return; 

//this loop executes 3 times with recursive call of n-1 value 
for (i=l ; i <= 3 ; i++ ) 

function (n - 1). 

} 

The if statement requires constant time (0(1)). With the for loop, we neglect the loop overhead and 
only count the three times that the function is called recursively. This implies a time complexity 
recurrence: 

T(n) = c,if n < 1; 

= c + 3T(n — 1), if n > 1. 

Now we use repeated substitution to guess at the solution when we substitute k times: 

T(ri) = c + 3T(n - 1) 

Using the Subtraction and Conquer master theorem, we get T (n) = 0 (3"). 

Problem-39 Write a recursion formula for the running time T(n ) of the function /, whose code is 
below. What is the running time of function, as a function of n? 
function (int n) 

[ 

if (n <= 1) 

return; 
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int i = 1 ; 

for(i = 1; i < n; i + +) 
print(“*”); 
function ( 0.8n ); 

} 

Solution: Consider the comments in below function: 

function (int n) 

1 

//constant time 
if (n <= 1) 

return; 

//constant time 
int i = 1 ; 

// this loop executes n times with constant time loop 
for(i = 1; i < n; i + +) 
print(“*”); 

//recursive call with 0.8n 
function ( 0.8n ) ; 

} 

The recurrence for this piece of code is T(n) = T (.8n) + 0 (n) 

T( n ) = T(^n) + 0(n) 

T(n) = ^T(n) + 0(n) 

Applying master theorem, we get T (n) = 0(n). 

Problem-40 Find the complexity of the recurrence: T(n ) = 2T(Vri) + logn 

Solution: The given recurrence is not in the master theorem form. Let try to convert this master theorem 
format. For that let use assume that n — 2 m . 

Applying logarithm on both sides gives, logn = mlog2 => m = logn 
Now, the given function becomes, 

( 771 \ 

2TJ + m. 

To make it simple we assume S(m) = T(2 m ) => 5(y) = T( 2T) S(m) = 2S + ni 

Applying the master theorem would result S(m ) = O(mlogni) 

If we substitute m = logn back, T(n) = S(logri) = 0((logri) loglogri). 

Problem-41 Find the complexity of the recurrence: T(n ) = T(Vn) + 1 
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Solution: We apply the same logic as that of Problem-40 and we get 

S(m) = S (—J + 1 

Applying the master theorem would result S(m ) = O(logm). 

Substituting m — logn, gives T(n) = S(log n) = O(loglogn). 

Problem-42 Find the complexity of the recurrence: T(n) = 2 T (\/n) + 1 

Solution: Applying the logic of Problem-40, gives: 

/7Tl\ 

S(m) = 25 (y) + 1 

Using the master theorem results S(ni) = 0(m l ° a2 ') = 0(m). 

Substituting m = logn gives T(n) = O(logri). 

Problem-43 Find the complexity of the below function, 
int Function (int n) 

1 

if (n <= 2) 

return 1; 

else 

return (Function (floor(sqrt(n))) + 1); 

3 

Solution: Consider the comments in below function: 

int Function (int n) 

1 

//constant time 
if (n <= 2) 

return 1; 

else 

// executes yfn + 1 times 

return (Function (floor(sqrt(n))) + 1); 

} 

For the above function, the recurrence function can be given as: T(n ) = T(\fn) + 1. And, this is same as 
that of Problem-41. 

Problem-44 Analyze the running time of the following recursive procedure as a function of n. void 
function(int n) 

1 

if ( n < 2 ) 

return; 

else 
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} 


counter = 0; 

for i = 1 to 8 do 

function (-); 

v 2' 

for I =1 to n 3 do 

counter = counter + 1; 


Solution: Consider the comments in below function and let us refer to the running time of function (n) as 

T(n). 


void function (int n) 

1 

//constant time 
if ( n < 2 ) 

return; 

else 

counter = 0; 

// this loop executes 8 times with n value half in every call 
for i = 1 to 8 do 

function (-); 

// this loop executes n 3 times with constant time loop 
for I =1 to n 3 do 

counter = counter + 1; 

3 

T (n) can be defined as follows: 

T(n ) = 1 if n < 2, 
n , 

= 8 7(—) + n 3 + 1 otherwise. 

Using the master theorem gives, T(n ) = Q(n l ° 9 * logn) = 0 (n 3 log n). 

Problem-45 Find the complexity of the below function, 
temp = 1 
repeat 

for i = 1 to n 

temp = temp + 1; 
n 

n = 

2 

until n <= 1 

Solution: Consider the comments in below function: 

//const time 
temp = 1 
repeat 
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// this loops executes n times 
for i = 1 to n 

temp = temp + 1; 

//recursive call with - value 

2 

n 

n = 

2 

until n <= 1 


The recurrence for this function is T (n) = T(j) + n. 
Using master theorem, we get, T(n ) = 0(n). 


Problem-46 Running time of following program? 
function(int n) 

1 

for( i=l ;i<=n;i + + ) 


} 


for( j = 1 ; j <= n ; j * = 2 ) 
print/ ) ; 


Solution: Consider the comments in below function: 


function (int n) 

1 


} 


// this loops executes n times 
for( i=l ;i<=n;i + + ) 

// this loops executes logn times from our logarithms 
//guideline 

for( j = 1 ; j <= n ; j * = 2 ) 
print/ ) ; 


Complexity of above program is : O(nlogri). 


Problem-47 Running time of following program? 
function/int n) 

1 

for/ i = 1 ; i <= n/3 ; i + + ) 


} 


for/ j = 1 ; j <= n ; j += 4 ) 
print/ ) ; 


Solution: Consider the comments in below function: 
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} 


// this loops executes n/3 times 
for( i = 1 ; i <= n/3 ; i + + ) 

// this loops executes n/4 times 
for( j = 1 ; j <= n ; j += 4) 
print/ ) ; 


The time complexity of this program is : 0(n 2 ). 


Problem-48 Find the complexity of the below function, 
void function(int n) 

1 

if(n <= 1) 

return; 

if (n > 1) 

1 

print ("*"); 
function/ j ); 
function/1); 

} 

3 


Solution: Consider the comments in below function: 


void function/int n) 

1 

//constant time 
if(n <= 1) 

return; 
if (n > 1) 

1 

//constant time 
print ("*"); 

//recursion with n/2 value 
function/ n/2 ); 

//recursion with n/2 value 
function/ n/2 ); 

} 

} 

The recurrence for this function is: T(n ) — 2 T + 1 
Using master theorem, we get T(n ) = 0(n). 


Problem-49 Find the complexity of the below function. 
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Chapter-3 



3.1 What is a Linked List? 

Linked list is a data structure used for storing collections of data. Linked list has the following properties. 

• Successive elements are connected by pointers 

• Last element points to NULL 

• Can grow or shrink in size during execution of a program 

• Can be made just as long as required (until systems memory exhausts) 

• It does not waste memory space (but takes some extra memory for pointers) 



Head 


3.2 Linked Lists ADT 

The following operations make linked lists an ADT. 

Main Linked Lists Operations 

• Insert: inserts an element into the list 

• Delete: removes and returns the specified position element from the list 
Auxiliary Linked Lists Operations 

• Delete List: removes all elements of the list (disposes the list) 

• Count: returns the number of elements in the list 

• Find n th node from the end of the list etc... 

3.3 Why Linked Lists? 

There are many other data structures which do the same thing as that of linked lists. Before discussing linked lists it is 
important to understand the difference between linked lists and arrays. Both linked lists and arrays are used to store 
collections of data. Since both are used for the same purpose, we need to differentiate the usage of them. That means in 
which cases arrays are suitable and in which cases linked lists are suitable. 

3.4 Arrays Overview 

One memory block is allocated for the entire array to hold the elements of the array. The array elements can be 
accessed in a constant time by using the index of the particular element as the subscript. 


Index 
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Why Constant Time for Accessing Array Elements? 

To access an array element, address of an element is computed as an offset from the base address of the array and one 
multiplication is needed to compute what is supposed to be added to the base address to get the memory address of the 
element. First the size of an element of that data type is calculated and then it is multiplied with the index of the 
element to get the value to be added to the base address. 

This process takes one multiplication and one addition. Since these two operations take constant time, we can say the 
array access can be performed in constant time. 

Advantages of Arrays 

• Simple and easy to use 

• Faster access to the elements (constant access) 

Disadvantages of Arrays 

• Fixed size: The size of the array is static (specify the array size before using it). 

• One block allocation: To allocate the array at the beginning itself, sometimes it may not be possible to get the 
memory for the complete array (if the array size is big). 

• Complex position-based insertion: To insert an element at a given position then we may need to shift the 
existing elements. This will create a position for us to insert the new element at the desired position. If the 
position at which we want to add an element is at the beginning then the shifting operation is more 
expensive. 

Dynamic Arrays 

Dynamic array (also called as growable array, resizable array, dynamic table, or array list) is a random access, 
variable-size list data structure that allows elements to be added or removed. 

One simple way of implementing dynamic arrays is, initially start with some fixed size array. As soon as that array 
becomes full, create the new array of size double than the original array. Similarly, reduce the array size to half if the 
elements in the array are less than half. 

Note: We will see the implementation for dynamic arrays in Stacks, Queues and Hashing chapters. 

Advantages of Linked Lists 

Linked lists have advantages and disadvantages. The advantage of linked lists is that they can be expanded in constant 
time. To create an array we must allocate memory for a certain number of elements. To add more elements to the 
array then we must create a new array and copy the old array into the new array. This can take lot of time. 

We can prevent this by allocating lots of space initially but then you might allocate more than you need and wasting 
memory. With a linked list we can start with space for just one element allocated and add on new elements easily 
without the need to do any copying and reallocating. 

Issues with Linked Lists (Disadvantages) 

There are a number of issues in linked lists. The main disadvantage of linked lists is access time to individual 
elements. Array is random-access, which means it takes 0(1) to access any element in the array. Linked lists takes 
O(n) for access to an element in the list in the worst case. Another advantage of arrays in access time is special 
locality in memory. Arrays are defined as contiguous blocks of memory, and so any array element will be physically 
near its neighbors. This greatly benefits from modern CPU caching methods. 

Although the dynamic allocation of storage is a great advantage, the overhead with storing and retrieving data can 
make a big difference. Sometimes linked lists are hard to manipulate. If the last item is deleted, the last but one must 
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now have its pointer changed to hold a NULL reference. This requires that the list is traversed to find the last but one 
link, and its pointer set to a NULL reference. Finally, linked lists wastes memory in terms of extra reference points. 

3.5 Comparison of Linked Lists with Arrays & Dynamic Arrays 


Parameter 

Linked list 

Array 

Dynamic array 

Indexing 

0(n) 

0(1) 

0(1) 

Insertion/deletion at beginning 

0(1) 

- 

0(n) 

Insertion/deletion at ending 

0(n) 

- 

0(1), if the array is not full 
0(n), if the array is full 

Insertion/deletion in middle 

0(71) 

- 

0(71) 

Wasted space 

0(71) 

0 

0(71) 


3.6 Singly Linked Lists 

Generally "linked list" means a singly linked list. This list consists of a number of nodes in which each node has a next 
pointer to the following element. The link of the last node in the list is NULL which indicates end of the list. 

NULL 

Head 

Following is a type declaration for a finked list of integers: 
struct ListNode { 
int data; 

struct ListNode ‘next; 

}; 

Basic Operations on a List 

• Traversing the fist 

• Inserting an item in the list 

• Deleting an item from the list 

Traversing the Linked List 

Let us assume that the head points to the first node of the fist. To traverse the list we do the following. 

• Follow the pointers. 

• Display the contents of the nodes (or count) as they are traversed. 

• Stop when the next pointer points to NULL. 

NULL 

Head 

The ListLength() function takes a finked fist as input and counts the number of nodes in the fist 
used for printing the fist data with extra print function. 

int ListLength(struct ListNode ‘head) { 
struct ListNode ‘current = head; 

int count = 0; _ 

3.5 Comparison of Linked Lists with Arrays & Dynamic Arrays 


. Below function can be 
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while (current != NULL) { 
count++; 

current = current—s-next; 

3 

return count; 

3 

Time Complexity: O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

Singly Linked List Insertion 

Insertion into a singly-linked list has three cases: 

• Inserting a new node before the head (at the beginning) 

• Inserting a new node after the tail (at the end of the list) 

• Inserting a new node at the middle of the list (random location) 

Note: To insert an element in the linked list at some position p, assume that after inserting the element the position of 
this new node is p. 

Inserting a Node in Singly Linked List at the Beginning 

In this case, a new node is inserted before the current head node. Only one next pointer needs to be modified (new 
node’s next pointer) and it can be done in two steps: 

• Update the next pointer of new node, to point to the current head. 

New node 


data 


15 


40 


NULL 


head 

Update head pointer to point to the new node. 
New node 



Head 

Inserting a Node in Singly Linked List at the Ending 

In this case, we need to modify two next pointers (last nodes next pointer and new nodes next pointer). 
• New nodes next pointer points to NULL. 

NULL 



Head 


• Last nodes next pointer points to the new node. 
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New node 



NULL 


Head 


Inserting a Node in Singly Linked List at the Middle 

Let us assume that we are given a position where we want to insert the new node. In this case also, we need to modify 
two next pointers. 

• If we want to add an element at position 6 then we stop at position 5. That means we traverse 5 nodes and 
insert the new node. For simplicity let us assume that fifth is called position node. New node points to the 
next node of the position where we want to add this node. 

Position node 



NULL 




New node 

Position nodes next pointer now points to the new node. 
Position node 



NULL 


New node 


Let us write the code for all these three cases. We must update the first element pointer in the calling function, not 
just in the called function. For this reason we need to send double pointer. The following code inserts a node in the 
singly linked list. 


void InsertlnLinkedList (struct ListNode “head, int data, int position) { 

int k= 1; 

struct ListNode *p, *q, ‘newNode; 

newNode = (ListNode *) malloc(sizeof(struct ListNode)); 

if(!newNode) { //Always Check for Memory Errors 

printf (“Memory Error”); 
return; 

} 

newNode—>data=data; 

p = ‘head; 

if (position == 1) { // Inserting at the beginning 

newNode—>-next = p; 

‘head = newNode; 

3 
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else { //Traverse the list until position-1 

while ((p != NULL) && (k < position - 1)) { 
k++; 

q = p; 

p = p—mext; 

} 

if(p == NULL) { //Inserting at the end 

q—>next = newNode; 
newNode—>next = NULL; 

} 

else { // In the middle 

q—>next = newNode; 
newNode—>next = p; 

} 

} 

} 


Note: We can implement the three variations of the insert operation separately. 

Time Complexity: 0(n). Since, in the worst we may need to insert the node at end of the list. Space Complexity: 0(1), 
for creating one temporary variable. 


Singly Linked List Deletion 

As similar to insertion here also we have three cases. 

• Deleting the first node 

• Deleting the last node 

• Deleting an intermediate node 


Deleting the First Node in Singly Linked List 

First node (current head node) is removed from the list. It can be done in two steps: 
• Create a temporary node which will point to same node as that of head. 



Head Temp 


• Now, move the head nodes pointer to the next node and dispose the temporary node. 



NULL 


Temp Head 


Deleting the last node in Singly Linked List 

In this case, last node is removed from the list. This operation is a bit trickier than removing the first node, because 
algorithm should find a node, which is previous to the tail first. It can be done in three steps: 
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• Traverse the list and while traversing maintain the previous node address also. By the time we reach the end 
of list, we will have two pointers one pointing to the tail node and other pointing to the node before tail 
node. 



NULL 


• Update previous nodes next pointer with NULL. 


NULL 





NULL 


• Dispose the tail node. 


NULL 



NULL 


Deleting an Intermediate Node in Singly Linked List 

In this case, node to be removed is always located between two nodes. Head and tail links are not updated in this 
case. Such a removal can be done in two steps: 

• As similar to previous case, maintain previous node while traversing the list. Once we found the node to be 
deleted, change the previous nodes next pointer to next pointer of the node to be deleted. 



NULL 


Head 


Previous node Node to be deleted 




Dispose the current node to be deleted. 



NULL 


void DeleteNodeFromLinkedList (struct ListNode **head, int position) { 
int k = 1; 

struct ListNode *p, *q; 
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if(*head == NULL) { 

printf (“List Empty”); 
return; 

] 

p = ‘head; 

imposition == 1) { /* from the beginning V 

p = ‘head; 

‘head = 'head—»next; 
free (p); return; 

] 

else { //Traverse the list until the position from which we want to delete 
while ((p != NULL) && (k < position - 1)) { 
k++; q = p; 
p = p^-next; 

} 

if(p == NULL) /* At the end 7 

printf (“Position does not exist.”); 
else { /* From the middle 7 

q—»next = p—>next; 
free(p); 

3 

] 

} 

Time Complexity; O(n). In the worst we may need to delete the node at the end of the list. Space Complexity: 0(1). 
Since, we are creating only one temporary variable. 

Deleting Singly Linked List 

This works by storing the current node in some temporary variable and freeing the current node. After freeing the 
current node go to next node with temporary variable and repeat this process for all nodes. 

void DeleteLinkedList(struct ListNode “head) { 
struct ListNode ‘auxilaryNode, ‘iterator; 
iterator = ‘head; 
while (iterator) { 

auxilaryNode = iterator—>next; 

free(iterator); 

iterator = auxilaryNode; 

} 

‘head = NULL; // to affect the real head back in the caller. 

} 

Time Complexity; 0(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

3.7 Doubly Linked Lists 

The advantage of a doubly linked list (also called two - way linked list) is given a node in the list, we can navigate 
in both directions. A node in a singly linked list cannot be removed unless we have the pointer to its predecessor. But 
in doubly linked list we can delete a node even if we don’t have previous nodes address (since, each node has left 
pointer pointing to previous node and can move backward). The primary disadvantages of doubly linked lists are: 

• Each node requires an extra pointer, requiring more space. 
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• The insertion or deletion of a node takes a bit longer (more pointer operations). 

As similar to singly linked list, let us implement the operations of doubly linked lists. If you understand the singly 
linked list operations then doubly li n k ed list operations are very obvious. Following is a type declaration for a doubly 
Hnked list of integers: 

struct DLLNode { 
int data; 

struct DLLNode ‘next; 
struct DLLNode *prev; 

}; 

Doubly Linked List Insertion 

Insertion into a doubly-bnked list has three cases (same as singly linked list): 

• Inserting a new node before the head. 

• Inserting a new node after the tail (at the end of the list). 

• Inserting a new node at the middle of the list. 

Inserting a Node in Doubly Linked List at the Beginning 

In this case, new node is inserted before the head node. Previous and next pointers need to be modified and it can be 
done in two steps: 

• Update the right pointer of new node to point to the current head node (dotted link in below figure) and also 
make left pointer of new node as NULL. 


New node 



NULL NULL 

• Update head nodes left pointer to point to the new node and make new node as head. 
Head 



T 

NULL 


Inserting a Node in Doubly Linked List at the Ending 


In this case, traverse the list till the end and insert the new node. 

• New node right pointer points to NULL and left pointer points to the end of the list. 

Head List end node New node 



NULL 


NULL NULL 


• Update right of pointer of last node to point to new node. 
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Head 



NULL NULL 


Inserting a Node in Doubly Linked List at the Middle 


As discussed in singly linked lists, traverse the list till the position node and insert the new node. 

• New node right pointer points to the next node of the position node where we want to insert the new node. 
Also, new node left pointer points to the position node. 


NULL Position node 



NULL 


New node 

Position node right pointer points to the new node and the next node of position nodes left pointer points to 
new node. 


NULL Position node 



NULL 


New node 

Now, let us write the code for all these three cases. We must update the first element pointer in the r ailin g function, 
not just in the called function. For this reason we need to send double pointer. The following code inserts a node in 
the doubly linked list. 


void DLLInsert(struct DLLNode “head, int data, int position) { 

int k = 1; 

struct DLLNode *temp, ‘newNode; 

newNode = (struct DLLNode *) malloc(sizeof ( struct DLLNode )); 

if(!newNode) { //Always check for memory errors 

printf (“Memory Error”); return; 

) 

newNode—>data = data; 

imposition == 1) { //Inserting a node at the beginning 

newNode—>next = ‘head; 
newNode—>prev = NULL; 

‘head—»prev = newNode; 

‘head = newNode; 
return; 
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} 

temp = ‘head; 

while ((k < position - 1) && temp—>next!=NULL) { 
temp = temp—mext; 
k++; 

} 

if( temp—>next== NULL) ( //Insertion at the end 

newNode—>next = temp—>next; 
newNode—>prev = temp; 
temp—>next = newNode; 

1 

else { //Insertion in the middle 

newNode->next = temp—»next; 
newNode—»prev = temp; 
temp—>next—>prev = newNode; 
temp—»next = newNode; 

1 

return; 

} 

Time Complexity: 0(n). In the worst we may need to insert the node at the end of the list. Space Complexity: 0(1), 
for creating one temporary variable. 


Doubly Linked List Deletion 

As similar to singly linked list deletion, here also we have three cases: 

• Deleting the first node 

• Deleting the last node 

• Deleting an intermediate node 


Deleting the First Node in Doubly Linked List 

In this case, first node (current head node) is removed from the list. It can be done in two steps: 
• Create a temporary node which will point to same node as that of head. 


NULL 



Head Temp 


• Now, move the head nodes pointer to the next node and change the heads left pointer to NULL. Then, dispose 
the temporary node. 



3.7 Doubly Linked Lists 


50 






Data Structures and Algorithms Made Easy 


Linked Lists 


Deleting the Last Node in Doubly Linked List 

This operation is a bit trickier, than removing the first node, because algorithm should find a node, which is previous 
to the tail first. It can be done in three steps: 

• Traverse the list and while traversing maintain the previous node address also. By the time we reach the end 
of list, we will have two pointers one pointing to the NULL (tail) and other pointing to the node before tail 
node. 


NULL 



NULL 


Head 


Previous node to Tail 


Tail 


• Update tail nodes previous nodes next pointer with NULL. 


NULL 


NULL 



Head 


Previous node to Tail Tail 


NULL 


• Dispose the tail node. 


NULL 



NULL 


Head 


Previous node to Tail 


Tail 


Deleting an Intermediate Node in Doubly Linked List 

In this case, node to be removed is always located between two nodes. Head and tail links are not updated in this 
case. Such a removal can be done in two steps: 

• As similar to previous case, maintain previous node also while traversing the list. Once we found the node to 
be deleted, change the previous nodes next pointer to the next node of the node to be deleted. 



Head Previous node Node to be deleted 


NULL 


• Dispose the current node to be deleted. 
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Head Previous node Node to be deleted 

void DLLDelete(stract DLLNode “head, int position) { 

struct DLLNode *temp, *temp2, temp = ‘head; 

int k = 1; 

iffhead == NULL) { 

printf(“List is empty"); return; 

} 

imposition == 1) { 

‘head = ‘head—>next; 
if(*head != NULL) 

‘head—>prev = NULL; 
free(temp); 
return; 

} 

while((k < position - 1) && temp—>next!=NULL) { 
temp = temp—»next; 
k++; 

} 

if( temp^next== NULL) { //Deletion from end 

temp2 = temp—>prev; 
temp2—+next = NULL; 
free(temp); 

} 

else { temp2 = temp—>prev; 

temp2—>next = temp—+next; 
temp—>next—>prev = temp2; 
free (temp); 

} 

return; 

} 

Time Complexity; O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

3.8 Circular Linked Lists 

In singly linked lists and doubly linked lists the end of lists are indicated with NULL value. But circular linked lists do 
not have ends. While traversing the circular linked lists we should be careful otherwise we will be traversing the list 
infinitely. In circular linked lists each node has a successor. Note that unlike singly linked lists, there is no node with 
NULL pointer in a circularly linked list. In some situations, circular linked lists are useful. 

For example, when several processes are using the same computer resource (CPU) for the same amount of time, and 
we have to assure that no process accesses the resource before all other processes did (round robin algorithm). 
Following is a type declaration for a circular linked list of integers: 

_ typedef struct CLLNode { _____ 
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int data; 

struct ListNode ‘next; 

1 ; 

In circular linked list we access the elements using the head node (similar to head node in singly linked list and 
doubly linked lists). 

Counting Nodes in a Circular List 



Head 


The circular list is accessible through the node marked head. To count the nodes, the list has to be traversed from 
node marked head, with the help of a dummy node current and stop the counting when current reaches the starting 
node head. If the list is empty, head will be NULL, and in that case set count = 0. Otherwise, set the current pointer 
to the first node, and keep on counting till the current pointer reaches the starting node. 

int CircularListLength(struct CLLNode ‘head) { 
struct CLLNode ‘current = head; 
int count = 0; 

if(head == NULL) return 0; 
do { current = current—>next; 

count++; 

} while (current != head); 
return count; 

1 

Time Complexity; 0(n), for scanning the complete list of size n. Space Complexity; 0(1), for creating one temporary 
variable. 

Printing the contents of a circular list 

We assume here that the list is being accessed by its head node. Since all the nodes are arranged in a circular fashion, 
the tail node of the list will be the node next to the head node. Let us assume we want to print the contents of the 
nodes starting with the head node. Print its contents, move to the next node and continue printing till we reach the 
head node again. 



Head 


void PrintCircularListData(struct CLLNode ‘head) { 
struct CLLNode ‘current = head; 
if(head = NULL) return; 
do { printf (“%d”, current—>data); 

current = current—mext; 
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] while (current != head); 

1 

Time Complexity; 0(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating ne temporary 
variable. 


Inserting a Node at the End of a Circular Linked List 

Let us add a node containing data, at the end of a list (circular list) headed by head. The new node will be placed just 
after the tail node (which is the last node of the list), which means it will have to be inserted in between the tail node 
and the first node. 

• Create a new node and initially keep its next pointer points to itself. 



New node 


Update the next pointer of new node with head node and also traverse the list until the tail. That means in 
circular fist we should stop at a node whose next node is head. 



Update the next pointer of previous node to point to new node and we get the fist as shown below. 



Head 


void InsertAtEndlnCLL (struct CLLNode “head, int data) { 
struct CLLNode current = ’head; 

struct CLLNode ‘newNode = (struct node*) (malloc(sizeof(struct CLLNode))); 
if(!newNode) { 

printf(“Memory Error”); return; 

} 

newNode—+data = data; 
while (current—>next != ‘head) 

current = current—>next; 
newNode—>next = newNode; 

_iffhead ==NULL)_ 
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‘head = newNode; 
ese { newNode-+next = ‘head; 

current—>next = newNode; 

} 

3 

Time Complexity: O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

Inserting a Node at Front of a Circular Linked List 

The only difference between inserting a node at the beginning and at the ending is that, after inserting the new node 
we just need to update the pointer. Below are the steps for doing the same. 

• Create a new node and initially keep its next pointer points to itself. 



Head 


• Update the next pointer of new node with head node and also traverse the list until the tail. That means in 



New node 

Update the previous node of head in the list to point to new node. 


4 - 


15 


40 


Head 


data 




New node 

• Make new node as head. 



i 


Head 

void InsertAtBeginlnCLL (struct CLLNode “head, int data) { 
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struct CLLNode ‘current = ‘head; 

struct CLLNode * newNode = (struct node*) (malloc(sizeof(struct CLLNode))); 
if(!newNode) { 

printf(“Memory Error”); return; 

5 

newNode—>data - data; 
while (current—>next != ‘head) 

current = current'—s-next; 
newNode—»next = newNode; 
if(*head ==NULL) ‘head = newNode; 
else { newNode—>next = ‘head; 

current—>next = newNode; 

‘head = newNode; 

1 

Return; 

} 

Time Complexity: O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating only one 
temporary variable. 

Deleting the Last Node in a Circular List 

The list has to be traversed to reach the last but one node. This has to be named as the tail node, and its next field has 
to point to the first node. Consider the following list. To delete the last node 40, the list has to be traversed till you 
reach 7. The next field of 7 has to be changed to point to 60, and this node must be renamed pTail. 

• Traverse the fist and find the tail node and its previous node. 



• Update the tail nodes previous node next pointer to point to head. 



• Dispose the tail node. 
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void DeleteLastNodeFromCLL (struct CLLNode “head) { 
struct CLLNode ‘temp = ‘head; 
struct CLLNode ‘current = ‘head; 
iffhead = NULL) { 

printf( “List Empty”); 
return; 

1 

while (current—>next != ‘head) { 
temp = current; 
current = current—>next; 

} 

free(current); 

return; 

} 

Time Complexity: O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

Deleting the First Node in a Circular List 

The first node can be deleted by simply replacing the next field of tail node with the next field of the first node. 

• Find the tail node of the linked list by traversing the list. Tail node is the previous node to the head node 
which we want to delete. 



Node to be 
Head deleted 


Previous node to 
deleting node 


Create a temporary which will point to head. Also, update the tail nodes next pointer to point to next node of 
head (as shown below). 



Node to be 
Head deleted 


Previous node to 
deleting node 


• Now, move the head pointer to next node. Create a temporary which will point to head. Also, update the tail 
nodes next pointer to point to next node of head (as shown below). 



Node to be 
deleted 


Previous node to 
deleting node 
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void DeleteFrontNodeFromCLL (struct CLLNode ""head) { 
struct CLLNode *temp = "head; 
struct CLLNode "current = "head; 
if(*head == NULL) { 

printf(“List Empty”); return; 

} 

while (current—>next != "head) 

current = current—>next; 
current—> next = "head—>next; 

"head = "head—♦next; 

free(temp); 

return; 

} 

Time Complexity: O(n), for scanning the complete list of size n. Space Complexity: 0(1), for creating one temporary 
variable. 

Applications of Circular List 

Circular linked lists are used in managing the computing resources of a computer. We can use circular lists for 
implementing stacks and queues. 

3.9 A Memory-Efficient Doubly Linked List 

In conventional implementation, we need to keep a forward pointer to the next item on the list and a backward 
pointer to the previous item. That means, elements in doubly linked list implementations consists of data, a pointer to 
the next node and a pointer to the previous node in the list as shown below. 

Conventional Node Definition 

typedef struct ListNode { 
int data; 

struct ListNode * prev; 
struct ListNode * next; 

}; 

Recently a journal (Sinha) presented an alternative implementation of the doubly linked list ADT, with insertion, 
traversal and deletion operations. This implementation is based on pointer difference. Each node uses only one pointer 
field to traverse the list back and forth. 

New Node Definition 

typedef struct ListNode { 
int data; 

struct ListNode * ptrdiff; 

}; 

The ptrdiff pointer field contains the difference between the pointer to the next node and the pointer to the previous 
node. Pointer difference is calculated by using exclusive-or (©) operation. 

ptrdiff = pointer to previous node © pointer to next node. 

The ptrdiff of the start node (head node) is the © of NULL and next node (next node to head). Similarly, the 
ptrdiff of end node is the © of previous node (previous to end node) and NULL. As an example, consider the 
following linked list. 
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- -►NULL 


In the above example, 

• The next pointer of A is: NULL © B 

• The next pointer of B is: A © C 

• The next pointer of C is: B © D 

• The next pointer of D is: C © NULL 

Why does it work? 

To have answer for this question let us consider the properties of ©: 

X©X = 0 
X©0 = X 

X © Y = Y © X (symmetric) 

(X © Y) © Z = X © (Y © Z) (transitive) 

For the above example, let us assume that we are at C node and want to move to B. We know that Cs ptrdiff is 

defined as B © D. If we want to move to B, performing © on Cs ptrdiff with D would give B. This is due to fact 

that, 

(B © D) © D = B (since, D © D=0) 

Similarly, if we want to move to D, then we have to applying © to Cs ptrdiff with B would give D. 

(B © D) © B = D (since, B © B=0) 

From the above discussion we can see that just by using single pointer, we are able to move back and forth. A 
memory-efficient implementation of a doubly linked list is possible to have without compromising much timing 
efficiency. 

3.10 Problems on Linked Lists 

Problem-1 Implement Stack using Linked List 
Solution: Refer Stacks chapter. 

Problem-2 Find n th node from the end of a Linked List. 

Solution: Brute-Force Method: Start with the first node and count how many nodes are there after that node. If the 
number of nodes are < n — 1 then return saying “fewer number of nodes in the list”. If the number of nodes are 
> n — 1 then go to next node. Continue this until the numbers of nodes after current node are n — 1. 

Time Complexity: 0(n 2 ), for scanning the remaining list (from current node) for each node. Space Complexity: 0(1). 

Problem-3 Can we improve the complexity of Problem-2? 

Solution: Yes, using hash table. As an example consider the following list. 

NULL 

Head 

In this approach, create a hash table whose entries are < position of node, node address >. That means, key is the 
position of the node in the list and value is the address of that node. 




Pointer differences 
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Position in List 

Address of Node 

1 

Address of 5 node 

2 

Address of 1 node 

3 

Address of 17 node 

4 

Address of 4 node 


By the time we traverse the complete list (for creating hash table), we can find the list length. Let us say, the list length 
is M. To find n th from end of finked fist, we can convert this to M-n+ 1 th from the beginning. Since we already 
know the length of the fist, it’s just a matter of returning M-n+ 1 th key value from the hash table. 

Time Complexity: Time for creating the hash table. Therefore, T(m) = O(m). Space Complexity: O(m). Since, we 
need to create a hash table of size m. 

Problem-4 Can we use Problem-3 approach for solving Problem-2 without creating the hash table? 

Solution: Yes. If we observe the Problem-3 solution, what actually we are doing is finding the size of the finked fist. 
That means, we are using hash table to find the size of the finked fist. We can find the length of the finked fist just by 
starting at the head node and traversing the fist. So, we can find the length of the fist without creating the hash table. 
After finding the length, compute M - n + 1 and with one more scan we can get the M - n + 1 th node from the 
beginning. This solution needs two scans: one for finding the length of fist and other for finding M — n+ 1 th node 
from the beginning. 

Time Complexity: Time for finding the length + Time for finding the M- n + 1 th node from the beginning. Therefore, 
T(n = O(n) +0(n) « O(n). Space Complexity: 0(1). Since, no need of creating the hash table. 

Problem-5 Can we solve Problem-2 in one scan? 

Solution: Yes. Efficient Approach: Use two pointers pNthNode and pTemp. Initially, both points to head node of the 
fist. pNthNode starts moving only after pTemp made n moves. From there both moves forward until pTemp reaches 
end of the fist. As a result pNthNode points to n th node from end of the finked fist. 

Note: at any point of time both moves one node at time. 

struct ListNode *NthNodeFromEnd(struct ListNode ‘head , int NthNode) { 
struct ListNode ‘pTemp = NULL, ’pNthNode = NULL; 
int count = 0; 

for (pTemp = head; pTemp!= NULL;) { 
count++; 

if(NthNode - count == 0 ) 
pNthNode = head; 
else if(NthNode - count > 0) 

pNthNode = pNthNode—»pNext; 
pTemp = pTemp—>pNext; 

} 

if(pNthNode) return pNthNode; 
return NULL; 

} 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-6 Check whether the given finked fist is either NULL-terminated or ends in a cycle (cyclic) 

Solution: Brute-Force Approach. As an example consider the following finked fist which has a loop in it. The 
difference between this fist and regular fist is that, in this fist there are two nodes whose next pointers are same. In 
regular singly finked fists (without loop) each nodes next pointer is unique. That means, the repetition of next pointers 
indicates the existence of loop. 
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One simple and brute force way of solving this is, start with the first node and see whether there is any node whose 
next pointer is current nodes address. If there is a node with same address then that indicates that some other node is 
pointing to the current node and we can say loops exists. Continue this process for all the nodes of the linked list. 

Does this method works? As per the algorithm we are checking for the next pointer addresses, but how do we find the 
end of the linked list (otherwise we will end up in infinite loop)? 

Note: If we start with a node in loop, this method may work depending on the size of the loop. 

Problem-7 Can we use hashing technique for solving Problem-6? 

Solution: Yes. Using Hash Tables we can solve this problem. 

Algorithm: 

• Traverse the linked list nodes one by one. 

• Check if the nodes address is there in the hash table or not. 

• If it is already there in the hash table then that indicates that we are visiting the node which was already 
visited. This is possible only if the given linked list has a loop in it. 

• If the address of the node is not there in the hash table then insert that nodes address into the hash table. 

• Continue this process until we reach end of the linked list or we find loop. 

Time Complexity: O(n) for scanning the linked list. Note that we are doing only scan of the input. Space Complexity: 
O(n) for hash table. 

Problem-8 Can we solve the Problem-6 using sorting technique? 

Solution: No. Consider the following algorithm which is based on sorting. And then, we see why this algorithm fails. 
Algorithm: 

• Traverse the finked fist nodes one by one and take all the next pointer values into some array. 

• Sort the array which is having next node pointers. 

• If there is a loop in the linked fist, definitely two nodes next pointers will pointing to the same node. 

• After sorting if there is a loop in the fist, the nodes whose next pointers are same will come adjacent in the 
sorted fist. 

• If there is any such pair exists in the sorted fist then we say the finked fist has loop in it. 

Time Complexity: O (nlogn) for sorting the next pointers array. Space Complexity: O(n) for the next pointers array. 

Problem with above algorithm? The above algorithm works only if we can find the length of the fist. But if the fist is 
having loop then we may end up in infinite loop. Due to this reason the algorithm fails. 

Problem-9 Can we solve the Problem-6 in O(n)? 

Solution: Yes. Efficient Approach (Memory less Approach): This problem was solved by Floyd. The solution is named 
as Floyd cycle finding algorithm. It uses 2 pointers moving at different speeds to walk the linked fist. Once they enter 
the loop they are expected to meet, which denotes that there is a loop. This works because the only way a faster 
moving pointer would point to the same location as a slower moving pointer is, if somehow the entire fist or a part of 
it is circular. 

Think of a tortoise and a hare running on a track. The faster running hare will catch up with the tortoise if they are 
running in a loop. As an example, consider the following example and trace out the Floyd algorithm. From the below 
diagrams we can see that after the final step they are meeting at some point in the loop which may not be the starting 
of the loop. 
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Note: slowPtr (tortoise ) moves one pointer at a time and fastPtr (hare) moves two pointers at a time. 
slowPtr 



fastPtr 



fastPtr 

fastPtr slowPt 



int IsLinkedListContainsLoop(struct ListNode * head) { 
struct ListNode 'slowPtr = head, 'fastPtr = head; 
while(slowPtr && fastPtr) { 

fastPtr = fastPtr—mext; 
if(fastPtr == slowPtr) 


fastPtr 


slowPtr 

fastPtr 
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return 1; 

if(fastPtr == NULL) 
return 0; 

fastPtr = fastPtr—>-next; 
if(fastPtr == slowPtr) 
return 1; 

slowPtr = slowPtr—>next; 

} 

return 0; 

1 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-10 We are given a pointer to the first element of a linked list L. There are two possibilities for L, it either 
ends (snake) or its last element points back to one of the earlier elements in the list (snail). Give an algorithm that 
tests whether a given list L is a snake or a snail. 

Solution: It is same as Problem-6. 

Problem-11 Check whether the given linked list is either NULL-terminated or not. If there is a cycle find the start 

node of the loop. 

Solution: The solution is an extension to the previous solution (Problem-9). After finding the loop in the linked list, 
we initialize the slowPtr to head of the linked list. From that point onwards both slowPtr and fastPtr moves only 
one node at a time. The point at which they meet is the start of the loop. Generally we use this method for removing 
the loops. 

int FindBeginofLoop(struct ListNode * head) { 

struct ListNode ‘slowPtr = head, ‘fastPtr = head; 
int loopExists = 0; 
while(slowPtr && fastPtr) { 

fastPtr = fastPtr—>next; 
if(fastPtr == slowPtr) { 
loopExists = 1; 
break; 

3 

if(fastPtr == NULL) 

loopExists = 0; 
fastPtr = fastPtr—>next; 
if(fastPtr == slowPtr) { 
loopExists = 1; 
break; 

} 

slowPtr = slowPtr—vnext; 

} 

if(loopExists) { 

slowPtr = head; 
while(slowPtr != fastPtr) { 

fastPtr = fastPtr—»next; 
slowPtr = slowPtr—>next; 

] 

return slowPtr; 

} 

return NULL; 
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} 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-12 From the previous discussion and problems we understand that tortoise and hares meeting concludes 
the existence of loop, but how does moving tortoise to beginning of linked list while keeping the hare at meeting 
place, followed by moving both one step at a time make them meet at starting point of cycle? 

Solution: This problem is the heart of number theory. In Floyd cycle finding algorithm, notice that the tortoise and the 
hare will meet when they are n X L, where L is the loop length. Furthermore, the tortoise is at the midpoint between 
the hare and the beginning of the sequence, because of the way they move. Therefore the tortoise is n X L away from 
the beginning of the sequence as well. 

If we move both one step at a time, from the tortoise position and from the start of the sequence, we know that they 
will meet as soon as both are in the loop, since they are n X L, a multiple of the loop length, apart. One of them is 
already in the loop, so we just move the other one in single step until it enters the loop, keeping the other n x L away 
from it at all times. 

Problem-13 In Floyd cycle finding algorithm, does it work if we use the step 2 and 3 instead of 1 and 2? 

Solution: Yes, but the complexity might be more. Trace out some example. 

Problem-14 Check whether the given linked list is either NULL-terminated or not. If there is a cycle find the 
length of the loop. 

Solution: This solution is also an extension to the basic cycle detection problem. After finding the loop in the linked 
list, initialize the slowPtr to fastPtr. slowPtr keeps on moving until it again comes back to fastPtr. While moving 
slowPtr, use a counter variable which increments at the rate of 1. 
int FindLoopLength(struct ListNode * head) { 

struct ListNode ‘slowPtr = head, ‘fastPtr = head; 
int loopExists = 0, counter = 0; 
while(slowPtr && fastPtr) { 

fastPtr = fastPtr—>next; 
if(fastPtr == slowPtr) 
loopExists = 1; 
if(fastPtr == NULL) 

loopExists = 0; 
fastPtr = fastPtr—t-next; 
if(fastPtr == slowPtr) loopExists = 1; 
slowPtr = slowPtr—>next; 

} 

if(loopExists) { 

fastPtr = fastPtr—>next; 
while (slowPtr != fastPtr) { 

fastPtr = fastPtr—>next; 
counter++; 

] 

return counter; 

] 

return 0; //If no loops exists 

1 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-15 Insert a node in a sorted linked list 

Solution: Traverse the list and find a position for the element and insert it. 
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struct ListNode *InsertInSortedList(struct ListNode * head, struct ListNode * newNode) { 
struct ListNode ‘current = head, temp; 
if(!head) return newNode; 

// traverse the list until you find item bigger the new node value 
while (current != NULL && current->data < newNode->data){ 
temp = current; 
current = current—>next; 

} 

//insert the new node before the big item 
newNode—»next = current; 
temp—>next = newNode; 
return head; 

1 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-16 Reverse a singly linked list 

Solution: // iterative version 

struct ListNode *ReverseList(struct ListNode ‘head) { 

struct ListNode ‘temp = NULL, ‘nextNode = NULL; 
while (head) { 

nextNode = head—>next; 
head—mext = temp; 
temp = head; 
head = nextNode; 

•} 

return temp; 

} 

Time Complexity: 0(n). Space Complexity: 0(1). 

Problem-17 Suppose there are two singly linked lists both of which intersect at some point and become a single 
linked list. The head or start pointers of both the lists are known, but the intersecting node is not known. Also, the 
number of nodes in each of the list before they intersect are unknown and both list may have it different. Listl 
may have n nodes before it reaches intersection point and List2 might have m nodes before it reaches intersection 
point where m and n may be m = n,m < norm > n. Give an algorithm for fidning the merging point. 



Solution: Brute-Force Approach: One easy solution is to compare every node pointer in the first list with every other 
node pointer in the second list by which the matching node pointers will lead us to the intersecting node. But, the 
time complexity in this case will O(mn) which will be high. 

Time Complexity: O (mn). Space Complexity: 0(1). 

Problem-18 Can we solve Problem-17 using sorting technique? 

Solution: No. Consider the following algorithm which is based on sorting and see why this algorithm fails. 

Algorithm 

• Take first list node pointers and keep in some array and sort them. 
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• Take second list node pointers and keep in some array and sort them. 

• After sorting, use two indexes: one for first sorted array and other for second sorted array. 

• Start comparing values at the indexes and increment the index whichever is having lower value (increment 
only if the values are not equal). 

• At any point, if we were able to find two indexes whose values are same then that indicates that those two 
nodes are pointing to the same node and we return that node. 

Time Complexity: Time for sorting lists + Time for scanning (for comparing)=0 (mlogm) +0 (nlogri) +0(m + n). We 
need to consider the one which gives the maximum value. Space Complexity: 0(1). 

Problem with the above al g ori thm ? Yes. In the algorithm, we are storing all the node pointers of both the lists and 
sorting. But we are forgetting the fact that, there can be many repeated elements. This is because after the merging 
point all node pointers are same for both the lists. The algorithm works fine only in one case and it is when both lists 
have ending node at their merge point. 

Problem-19 Can we solve Problem-17 using hash tables? 

Solution: Yes. 

Algorithm: 

• Select a list which is having less number of nodes (If we do not know the lengths beforehand then select one 
list randomly). 

• Now, traverse the other list and for each node pointer of this list check whether the same node pointer exists 
in the hash table or not. 

• If there a merge point for the give lists then we will definitely encounter the node pointer in the hash table. 

Time Complexity: Time for creating the hash table + Time for scanning the second list = O(m) + 0(n) (or 0(n) + 
O (m), depends on which list we select for creating the hash table). But in both cases the time complexity is same. 
Space Complexity: 0(n) or O(m). 

Problem-20 Can we use stacks for solving the Problem-17? 

Solution: Yes. 

Algorithm: 

• Create two stacks: one for the first list and one for the second list. 

• Traverse the first list and push all the node address on to the first stack. 

• Traverse the second list and push all the node address on to the second stack. 

• Now both stacks contain the node address of the corresponding lists. 

• Now, compare the top node address of both stacks. 

• If they are same, then pop the top elements from both the stacks and keep in some temporary variable (since 
both node addresses are node, it is enough if we use one temporary variable). 

• Continue this process until top node addresses of the stacks are not same. 

• This point is the one where the fists merge into single fist. 

• Return the value of the temporary variable. 

Time Complexity: 0(m + n), for scanning both the fists. Space Complexity: O (m + n), for creating two stacks for both 
the fists. 

Problem-21 Is there any other way of solving the Problem-17? 

Solution: Yes. Using “finding the first repeating number” approach in an array (for algorithm refer Searching 
chapter). 

Algorithm: 

• Create an array A and keep all the next pointers of both the fists in the array. 

• In the array find the first repeating element in the array [Refer Searching chapter for algorithm]. 

• The first repeating number indicates the merging point of the both fists. 
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Time Complexity: O (m + n). Space Complexity: 0(m + n). 

Problem-22 Can we still think for finding alternative solution for the Problem-17? 

Solution: Yes. By combining sorting and search techniques we can reduce the complexity. 

Algorithm: 

• Create an array A and keep all the next pointers of the first list in the array. 

• Sort these array elements. 

• Then, for each of the second list element, search in the sorted array (let us assume that we are using binary 
search which gives O(logri)). 

• Since we are scanning the second list one by one, the first repeating element which appears in the array is 
nothing but the merging point. 

Time Complexity: Time for sorting + Time for searching = 0(Max(mlogm, nlogri)). Space Complexity: 
O (Max(m, n)). 

Problem-23 Can we improve the complexity for the Problem-17? 

Solution: Yes. 

Efficient Approach: 

• Find lengths (LI and L2) of both list — O(n) + O (m) = 0(max(m, n)). 

• Take the difference d of the lengths — 0(1). 

• Make d steps in longer list — O(d). 

• Step in both lists in parallel until links to next node match — O (min(m, n )). 

• Total time complexity = 0(max(m, n)). 

• Space Complexity = 0(1). 

struct ListNode* FindIntersectingNode(struct ListNode* listl, struct ListNode* list2) { 
int L1=0, L2=0, diff=0; 

struct ListNode *headl = listl, *head2 = list2; 
while(headl != NULL) { 

L1++; 

headl = headl—>next; 

} 

while(head2!= NULL) { 

L2++; 

head2 = head2—*mext; 

} 

diff = LI - L2; 
if(Ll < L2) { 

headl = list2; 
head2 = listl; 
diff = L2 - LI; 

} 

for (int i = 0; i < diff; i++) 

headl = headl —>-next; 

while(headl != NULL && head2 != NULL) { 
if(headl == head2) 

return headl —>data; 
headl= headl—>next; 
head2= head2—>next; 

) 

return NULL; 
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} 

Problem-24 How will you find the middle of the linked list? 

Solution: Brute-Force Approach: For each of the node count how many nodes are there in the list and see whether it is 
the middle. 

Time Complexity: 0(n 2 ). Space Complexity: 0(1). 

Problem-25 Can we improve the complexity of Problem-24? 

Solution: Yes. 

Algorithm: 

• Traverse the list and find the length of the fist. 

• After finding the length, again scan the list and locate n/2 node from the beginning. 

Time Complexity: Time for finding the length of the list + Time for locating middle node = O(n) + 0(n) ~ 0(n). 
Space Complexity: 0(1). 

Problem-26 Can we use hash table for solving Problem-24/’ 

Solution: Yes. The reasoning is same as that of Problem-3. 

Time Complexity: Time for creating the hash table. Therefore, T(n) =0(n). Space Complexity: O(n). Since, we need 
to create a hash table of size n. 

Problem-27 Can we solve Problem-24 just in one scan? 

Solution: Efficient Approach: Use two pointers. Move one pointer at twice the speed of the second. When the first 
pointer reaches end of the list, the second pointer will be pointing to the middle node. 

Note: If the list has even number of nodes, the middle node will be of [n/2\. 

struct ListNode * FindMiddle(struct ListNode ‘head) { 
struct ListNode ‘ptrlx, *ptr2x; 
ptrlx = ptr2x = head; 
int i=0; 

// keep looping until we reach the tail (next will be NULL for the last node) 
while(ptrlx—mext != NULL) { 

i f( i==OH 

ptrlx = ptrlx—>next; //increment only the 1st pointer 
i=l; 

} 

else if( i == 1) { 

ptrlx = ptrlx—>next; //increment both pointers 
ptr2x = ptr2x—>next; 
i = 0; 

} 

1 

return ptr2x; //now return the ptr2 which points to the middle node 

} 

Time Complexity: 0(n). Space Complexity: 0(1). 

Problem-28 How will you display a linked list from the end? 

Solution: Traverse recursively till end of the linked list. While coming back, start printing the elements. 

//This Function will print the linked list from end 
void PrintListFromEnd(struct ListNode ‘head) { 
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if(!head) 

return; 

PrintListFromEnd(head—>next); 
printf(“%d ”,head—Mata); 

] 

Time Complexity: O(n). Space Complexity: O(n)—► for Stack. 

Problem-29 Check whether the given Linked List length is even or odd? 

Solution: Use 2x pointer. Take a pointer which moves at 2x [two nodes at a time]. At the end, if the length is even 
then pointer will be NULL otherwise it will point to last node, 
int IsLinkedListLengthEven(struct ListNode * listHead) { 
while(listHead && listHead—>next) 
listHead = listHead—»next—>next; 
if(!listHead) 

return 0; 
return 1; 

} 

Time Complexity: 0([n/2J) *0(n). Space Complexity: 0(1). 

Problem-30 If the head of a linked list is pointing to kth element, then how will you get the elements before kth 
element? 

Solution: Use Memory Efficient Linked Lists [XOR Linked Lists], 

Problem-31 Given two sorted Linked Lists, we need to merge them into the third list in sorted order. 

Solution: 

struct ListNode *MergeList(struct ListNode *a, struct ListNode *b) { 
struct ListNode ‘result = NULL; 
if(a == NULL) return b; 
if(b = NULL) return a; 
if(a—>data <= b—Mata) { 
result =a; 

result—>next = MergeList(a—mext, b); 

} 

else { 

result =b; 

result—>next = MergeList(b—»next,a); 

} 

return result; 

} 

Time Complexity - 0(n + m), where n and m are lengths of two lists. 

Problem-32 Reverse the linked list in pairs. If you have a linked list that holds 1 —>2—>3—>4—> X, then after 
the function has been called the linked list would hold 2 —> 1 —> 4 —> 3 —> X. 

Solution: //Recursive Version 

void ReversePairRecursive(struct ListNode ‘head) { 
struct ListNode ‘temp; 
if(head ==NULL [| head—>next ==NULL) 

return; //base case for empty or 1 element list 

else { //Reverse first pair 
temp = head—>next; 


3.10 Problems on Linked Lists 


69 


Data Structures and Algorithms Made Easy 


Linked Lists 


head—>next = temp—>next; 
temp—>next = head; 

//Call the method recursively for the rest of the list 
ReversePairRecursive(head—>next); 

1 

3 

/‘Iterative version*/ 

void ReversePairIterative(struct ListNode ‘head) { 

struct ListNode ‘temp, *temp2, ‘current = head; 
while(current != NULL && current—*next != NULL) { 

//Swap the pair 
temp = current—>next; 
temp2 = temp—rnext; 
temp—+next = current; 
current—>next = temp2; 

//Advance the current pointer 
recurrent) 

current = current—>next; 

} 

} 

Time Complexity - O(n). Space Complexity - 0(1). 

Problem-33 Given a binary tree convert it to doubly linked list. 

Solution: Refer Trees chapter. 

Problem-34 How do we sort the Linked Lists? 

Solution: Refer Sorting chapter. 

Problem-35 If we want to concatenate two linked lists which of the following gives 0(1) complexity? 

1) Singly linked lists 2) Doubly linked lists 3) Circular doubly linked lists 

Solution: Circular Doubly Linked Lists. This is because for singly and doubly linked lists, we need to traverse the first 
list till the end and append the second list. But in case of circular doubly linked lists we don’t have to traverse the lists. 

Problem-36 Split a Circular Linked List into two equal parts. If the number of nodes in the list are odd then make 
first list one node extra than second list. 

Solution: 

Algorithm 

• Store the mid and last pointers of the circular linked list using Floyd cycle finding algorithm. 

• Make the second half circular. 

• Make the first half circular. 

• Set head pointers of the two linked fists. 

As an example, consider the following circular fist. 



Head 


After the split, the above fist will look like: 
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/* structure for a node 7 
struct ListNode { 
int data; 

struct ListNode ‘next; 

}; 

void SplitList(struct ListNode ‘head, struct ListNode “headl, struct ListNode **head2) { 
struct ListNode ‘slowPtr = head; 
struct ListNode ‘fastPtr = head; 
if(head == NULL) return; 

/* If there are odd nodes in the circular list then fastPtr—>next becomes 
head and for even nodes fastPtr—>mext—>next becomes head V 
while(fastPtr—vnext != head && fastPtr—»next—>next != head) { 
fastPtr = fastPtr—»next—>next; 
slowPtr = slowPtr—i-next; 

3 

/* If there are even elements in list then move fastPtr 7 
if(fastPtr—>next—>next == head) 
fastPtr = fastPtr—>next; 

/* Set the head pointer of first half*/ 

‘headl = head; 

/* Set the head pointer of second half 7 
if(head—mext != head) 

*head2 = slowPtr—>next; 

I* Make second half circular V 
fastPtr—>next = slowPtr—>next; 

/* Make first half circular V 
slowPtr—>next = head; 

} 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-37 How will you check if the linked list is palindrome or not? 

Solution: 

Algorithm 

1. Get the middle of the linked list. 

2. Reverse the second half of the linked list. 

3. Compare the first half and second half. 

4. Construct the original linked list by reversing the second half again and attaching it back to the Erst half. 
Time Complexity: O(n). Space Complexity: 0(1). 

Problem-38 For a given K value ( K > 0) reverse blocks of K nodes in a list. 

Example: Input: 123456789 10. Output for different K values: 

For If = 2:2 14 3 6 5 8710 9 For K = 3: 3 2 1 6 5 4 9 8 7 10 For K = 4: 4 3 2 1 8 7 6 5 9 10 
Solution: 
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Algorithm: This is an extension of swapping nodes in a linked list. 

1) Check if remaining list has K nodes. 

a. If yes get the pointer of K + 1 th node. 

b. Else return. 

2) Reverse first K nodes. 

3) Set next of last node (after reversal) to K + 1 th node. 

4) Move to K + 1 th node. 

5) Go to step 1. 

6) K — 1 th node of first K nodes becomes the new head if available. Otherwise, we can return the head. 

struct ListNode * GetKPlusOneThNode(int K, struct ListNode ‘head) { 
struct ListNode *Kth; 
int i = 0; 
if(!head) 

return head; 

for (i=0, Kth=head; Kth && (i < K); i++, Kth=Kth—>next); 
if(i==K && Kth!=NULL) 
return Kth; 
return head—>next; 

} 

int HasKnodes(struct ListNode ‘head, int K) { 
int i =0; 

for(i=0; head && (i < K); i++, head=head—>next); 
if(i == K) 

return 1; 

return 0; 

] 

struct ListNode *ReverseBlockOfK-nodesInLinkedList(struct ListNode ‘head, int K) { 
struct ListNode *cur=head, ‘temp, ‘next, newHead; 
int i; 

if(K==0 || K==l) 

return head; 
if(HasKnodes(cur, K-l)) 

newHead = GetKPlusOneThNode(K-l, cur); 
else newHead = head; 
while(cur && HasKnodes(cur, K)) { 

temp = GetKPlusOneThNode(K, cur); 
i=0; 

while (i < K) { 

next = cur—»next; 
cur—>next=temp; 
temp = cur; 
cur = next; 
i++; 

} 

} 

return newHead; 

} 

Problem-39 Is it possible to get 0(1) access time for Linked Lists? 
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Solution: Yes. Create a linked list at the same time keep it in a hash table. For n elements we have to keep all the 
elements into hash table which gives preprocessing time of 0(n). To read any element we require only constant time 
0(1) and to read n elements we require n * 1 unit of time = n units. Hence by using amortized analysis we can say 
that element access can be performed within 0(1) time. 

Time Complexity - 0(1) [Amortized], Space Complexity - O(n) for Hash. 

Problem-40 JosephusCircle: N people have decided to elect a leader by arranging themselves in a circle and 
eliminating every M th person around the circle, closing ranks as each person drops out. Find which person will be 
the last one remaining (with rank 1). 

Solution: Assume the input is a circular linked list with JV nodes and each node has a number (range 1 to N ) associated 
with it. The head node has number 1 as data. 

struct ListNode *GetJosephusPosition(){ 
struct ListNode *p, *q; 

printf("Enter N (number of players): "); scanf("%d", &N); 
printf("Enter M (every M-th payer gets eliminated):"); scanf("%d", &M); 

// Create circular linked list containing all the players: 
p = q = malloc(sizeof(struct node)); 
p—>data = 1; 

for (int i = 2; i <= N; ++i) { 

p—>next = malloc(sizeof(struct node)); 
p = p—>next; 
p—>data = i; 

} 

p—»next = q; // Close the circular linked list by having the last node point to the first. 

// Eliminate every M-th player as long as more than one player remains: 
for (int count = N; count > 1; —count) { 
for (int i = 0; i < M - 1; ++i) 
p = p—>next; 

p—>next = p—>next—>next; // Remove the eiminated player from the circular linked list. 

} 

printf("Last player left standing (Josephus Position) is %d\n.", p—>data); 

} 

Problem-41 Given a linked list consists of data, next pointer and also a random pointer which points to a random 
node of the list. Give an algorithm for cloning the list. 

Solution: We can use the hash table to associate newly created nodes with the instances of node in the given list. 
Algorithm: 

• Scan the original list and for each node X, create a new node Y with data of X, then store the pair ( X, Y) in 
hash table using A as a key. Note that during this scan we set Y—> next and Y—> random to NULL and we 
will fix them in the next scan 

• Now for each node X in the original list we have a copy Y stored in our hash table. We scan again the original 
list and set the pointers buildings the new list 

struct ListNode ‘Clone(struct ListNode *head){ 
struct ListNode *X, *Y; 
struct HashTable *HT = CreateHashTableQ; 

X = head; 

while (X != NULL) { 

Y = (struct ListNode *)malloc(sizeof(struct ListNode *)); 

Y—»data = X—>data; 
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Y—>next = NULL; 

Y—random = NULL; 

HT.insert(X, Y); 

X = X—>next; 

1 

X = head; 

while (X != NULL) { 

// get the node Y corresponding to X from the hash table 

Y = HT.get(X); 

Y—>-next = HT.get(X—►next); 

Y.setRandom = LIT.get(X—^-random); 

X = X—>next; 

} 

// Return the head of the new list, that is the Node Y 
return HT.get(head); 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-42 Can we solve Problem-41 without any extra space? 

Solution: Yes. Follow the comments in below code and traceout. 

void Clone(struct ListNode *head){ 

struct ListNode "temp, *temp2; 

//Stepl: put temp—►random in temp2—►next, so that we can reuse the temp—^random field to point to temp2. 

temp = head; 

while (temp != NULL) { 

temp2 = (struct ListNode *)malloc(sizeof(struct ListNode *)); 

temp2—>data = temp—>data; 

temp2—>next = temp—>random; 

temp—^random = temp2; 

temp = temp—>next; 

} 

//Step2: Setting temp2—+random. temp2—>next is the old copy of the node that 
// temp2—^random should point to, so temp2—>next >random is the new copy, 
temp = head; 
while (temp != NULL) { 

temp2 = temp—^random; 

temp2—^random = temp2—>next—^random; 

temp = temp—>next; 

} 

//Step3: Repair damage to old list and fill in next pointer in new list. 

temp = head; 

while (temp != NULL) { 

temp2 = temp—^random; 
temp—►random = temp2—>next; 
temp2—>next = temp—>next—►random; 
temp = temp—>next; 

} 

} 

Time Complexity: 0(3n) ~0(n). Space Complexity: 0(1). 
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4.1 What is a Stack? 

A stack is a simple data structure used for storing data (similar to Linked Lists). In stack, the order in which the data 
arrives is important. The pile of plates of a cafeteria is a good example of stack. The plates are added to the stack as 
they are cleaned. They are placed on the top. When a plate is required it is taken from the top of the stack. The first 
plate placed on the stack is the last one to be used. 

Definition: A stack is an ordered list in which insertion and deletion are done at one end, where the end is called as 
top. The last element inserted is the first one to be deleted. Hence, it is called Last in First out (LIFO) or First in Last 
out (FILO) list. 

Special names are given to the two changes that can be made to a stack. When an element is inserted in a stack, the 
concept is called as push, and when an element is removed from the stack, the concept is called as pop. Trying to pop 
out an empty stack is called as underflow and trying to push an element in a full stack is called as overflow. 
Generally, we treat them as exceptions. As an example, consider the snapshots of the stack. 



4.2 How Stacks are used? 

Consider a working day in the office. Let us assume a developer is working on a long-term project. The manager then 
gives the developer a new task, which is more important. The developer places the long-term project aside and begins 
work on the new task. The phone then rings, this is the highest priority, as it must be answered immediately. The 
developer pushes the present task into the pending tray and answers the phone. W'hen the call is complete the task 
abandoned top answer the phone is retrieved from the pending tray and work progresses. If another call comes in, it 
may have to be handled in the same manner, but eventually the new task will be finished, and the developer can draw 
the long-term project from the pending tray and continue with that. 

4.3 Stack ADT 

The following operations make a stack an ADT. For simplicity assume the data is of integer type. 

Main stack operations 

• Push (int data): Inserts data onto stack. 
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• int Pop(): Removes and returns the last inserted element from the stack. 

Auxiliary stack operations 

• int Top(): Returns the last inserted element without removing it. 

• int Size(): Returns the number of elements stored in stack. 

• int IsEmptyStackQ: Indicates whether any elements are stored in stack or not. 

• int IsFullStackQ: Indicates whether the stack is full or not. 


Exceptions 

Attempting the execution of an operation may sometimes cause an error condition, called an exception. Exceptions are 
said to he “thrown” by an operation that cannot be executed. In the Stack ADT, operations pop and top cannot be 
performed if the stack is empty. Attempting the execution of pop (top) on an empty stack throws an exception. Trying 
to push an element in a full stack throws an exception. 


4.4 Applications 

Following are the some of the applications in which stacks plays an important role. 

Direct applications 

• Balancing of symbols 

• Infix-to-postfix conversion 

• Evaluation of postfix expression 

• Implementing function calls (including recursion) 

• Finding of spans (finding spans in stock markets, refer Problems section) 

• Page-visited history in a Web browser [Back Buttons] 

• Undo sequence in a text editor 

• Matching Tags in HTML and XML 

Indirect applications 

• Auxiliary data structure for other algorithms (Example: Tree traversal algorithms) 

• Component of other data structures (Example: Simulating queues, refer Queues chapter) 

4.5 Implementation 

There are many ways of implementing stack ADT and below are the commonly used methods. 

• Simple array based implementation 

• Dynamic array based implementation 

• Linked lists implementation 


Simple Array Implementation 


This implementation of stack ADT uses an array. In the array, we add elements from left to right and use a variable to 
keep track of the index of the top element. 
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The array storing the stack elements may become full. A push operation will then throw a full stack exception. 
Similarly, if we try deleting an element from empty stack then it will throw stack empty exception. 

struct ArrayStack { 
int top; 
int capacity; 
int ‘array; 

3 ; 

struct ArrayStack *CreateStack() { 

struct ArrayStack *S = malloc(sizeof(struct ArrayStack)); 
if(!S) return NULL; 

S—'capacity = 1; 

S—Hop = -1; 

S—>array= malloc(S—^capacity * sizeof(int)); 
if(!S—>-array) return NULL; 
return S; 

} 

int IsEmptyStack(struct ArrayStack *S) { 

return (S—>top == -1); // if the condition is true then 1 is returned else 0 is returned 

} 

int IsFullStack(struct ArrayStack *S){ 

//if the condition is true then 1 is returned else 0 is returned 
return (S—>top == S—^capacity - 1); 

3 

void Push(struct ArrayStack *S, int data)) 

/* S—'top == capacity -1 indicates that the stack is full*/ 
if(IsFullStack(S)) printf( “Stack Overflow”); 

else /‘Increasing the ‘top’ by 1 and storing the value at ‘top’ position*/ 

S—> array[++S—>top]= data; 

} 

int Pop(struct ArrayStack *S){ 

if(IsEmptyStack(S)){ /* S—>top == - 1 indicates empty stack*/ 

printf("Stack is Empty"); 
return 0; 

} 

else /* Removing element from ‘top’ of the array and reducing ‘top’ by 17 
return (S—► array[S—>top~]); 

} 

void DeleteStack(struct DynArrayStack *S){ 
if(S) { if(S—s-array) free(S—+array); 

free(S); 

1 

1 

Performance & Limitations 


Performance 


Let n be the number of elements in the stack. The complexities of stack operations with this representation can be 
given as: 

Space Complexity (for n push operations) O(n) 
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Time Complexity of Push() 

0(1) 

Time Complexity of Pop() 

0(1) 

Time Complexity of Size() 

0(1) 

Time Complexity of IsEmptyStack() 

0(1) 

Time Complexity of IsFullStack() 

0(1) 

Time Complexity of DeleteStackQ 

0(1) 


Limitations 

The maximum size of the stack must be defined in prior and cannot be changed. Trying to push a new element into a 
full stack causes an implementation-specific exception. 

Dynamic Array Implementation 

First, let’s consider how we implemented a simple array based stack. We took one index variable top which points to 
the index of the most recently inserted element in the stack. To insert (or push) an element, we increment top index 
and then place the new element at that index. Similarly, to delete (or pop) an element we take the element at top 
index and then decrement the top index. We represent empty queue with top value equal to -1. The issue still need 
to be resolved is that what we do when all the slots in fixed size array stack are occupied? 

First try: What if we increment the size of the array by 1 every time the stack is full? 

• PushQ.- increase size of S[] by 1 

• Pop(): decrease size of S[] by 1 

Problems with this approach? 

This way of incrementing the array size is too expensive. Let us a see the reason for this. For example, at n = 1, to 
push an element create a new array of size 2 and copy all the old array elements to new array and at the end add the 
new element. At n = 2, to push an element create a new array of size 3 and copy all the old array elements to new 
array and at the end add the new element. 

Similarly, at n = n — 1, if we want to push an element create a new array of size n and copy all the old array elements 
to new array and at the end add the new element. After n push operations the total time T(n ) (number of copy 
operations) is proportional to 1 + 2 + ... + n«0(n 2 ). 

Alternative Approach: Repeated Doubling 

Let us improve the complexity by using array doubling technique. If the array is full, create a new array of twice the 
size, and copy items. With this approach, pushing n items takes time proportional to n (not n 2 ). 

For simplicity, let us assume that initially we started with n = 1 and moved till n = 32. That means, we do the 

doubling at 1,2,4,8,16. The other way of analyzing the same is, at n = 1, if we want to add (push) an element then 

double the current size of array and copy all the elements of old array to new array. 

At, n = 1, we do 1 copy operation, at n = 2, we do 2 copy operations, and n = 4, we do 4 copy operations and so on. 
By the time we reach n = 32, the total number of copy operations is 1+2 + 4 + 84-16 = 31 which is 
approximately equal to 2n value (32). If we observe carefully, we are doing the doubling operation logn times. 

Now, let us generalize the discussion. For n push operations we double the array size logn times. That means, we will 
have logn terms in below expression. The total time T (n) of a series of n push operations is proportional to 

1 + 2 + 4 + 8... + T + tt + n ~ n + ^ + T + TJ■■■ + 4 + 2+ 1 

4 z z 4 o 

/ 111 4 2 1\ 

= n (1 + — + — + — ... -I-1-h — ) 

V 248 n n nJ 

= n(2 )« 2n = O(n) 

T(n) is O(n) and the amortized time of a push operation is 0(1) . 
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struct DynArrayStack { 
int top; 
int capacity; 
int ‘array; 

1 ; 

struct DynArrayStack *CreateStack(){ 

struct DynArrayStack *S = maIloc(sizeof(struct DynArrayStack)); 
if(!S) return NULL; 

S—^-capacity = 1; 

S—>-top = -1; 

S—>array = malloc(S—>-capacity * sizeof(int)); // allocate an array of size 1 initially 

if(!S—>array) return NULL; 
return S; 

3 

int IsFullStack(struct DynArrayStack *S){ 
return (S-»top == S->capacity-l); 

] 

void DoubleStack(struct DynArrayStack *S){ 

S—•■capacity ‘= 2; 

S—>array = realloc(S—>array, S—^capacity); 

3 

void Push(struct DynArrayStack *S, int x){ 

// No overflow in this implementation 
if(IsFullStack(S)) 

DoubleStack(S); 

S—>array[++S—>top] = x; 

} 

int IsEmptyStack(struct DynArrayStack *S){ 
return S—>top == -1; 

1 

int Top(struct DynArrayStack *S){ 
if(IsEmptyStack(S)) 

return INT_MIN; 
return S—»array[S—>top]; 

} 

int Popfstruct DynArrayStack *S){ 
if(IsEmptyStack(S)) 

return INT_MIN; 
return S—>array[S—»top—]; 

} 

void DeleteStack(struct DynArrayStack *S){ 
if(S) { 

if(S—>array) free(S-^-array); 
free(S); 

3 

3 

Performance 

Let n be the number of elements in the stack. The complexities for operations with this representation can be given as: 
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Space Complexity (for n push operations) [ O(n) 


Time Complexity of CreateStackQ 


Time Complexity of Push() 


Time Complexity of Pop() 


Time Complexity of Top() _ 


Time Complexity of IsEmptyStackQ 


Time Complexity of IsFullStackQ_ 


Time Complexity of DeleteStackQ_ 


Note: Too many doublings may cause memory overflow exception. 


Linked List Implementation 

The other way of implementing stacks is by using Linked lists. Push operation is implemented by inserting element at 
the beginning of the list. Pop operation is implemented by deleting the node from the beginning (the header/top 
node). 

NULL 


top 

struct ListNode{ 
int data; 

struct ListNode ’next; 

}; 

struct Stack *CreateStack(){ 
return NULL; 

} 

void Push(struct Stack “top, int data){ 
struct Stack ’temp; 
temp = malloc(sizeof(struct Stack)); 
if(!temp) return NULL; 
temp—>data = data; 
temp—>next = ’top; 

’top = temp; 

} 

int IsEmptyStack(struct Stack *top){ 
return top == NULL; 

} 

int Pop(struct Stack **top){ 
int data; 

struct Stack ’temp; 
if(IsEmptyStack(top)) 

return INT_MIN; 
temp = ’top; 

’top = ’top—>next; 
data = temp—>data; 
free(temp); 
return data; 
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int Top(struct Stack * topjf 

if(IsEmptyStack(top)) return INT_MIN; 
return top—>next—»data; 

} 

void DeleteStack(struct Stack **top){ 
struct Stack ’temp, *p; 
p = ’top; 

while( p—>next) { 

temp = p—>next; 
p—>next = temp—>next; 
free(temp); 

} 

free(p); 

} 

Performance 

Let n be the number of elements in the stack. Let n be the number of elements in the stack. The complexities for 
operations with this representation can be given as:_ 


Space Complexity (for n push operations) 

O(n) 

Time Complexity of CreateStackQ 

0(1) 

Time Complexity of PushQ 

0(1) (Average) 

Time Complexity of Pop() 

0(1) 

Time Complexity of Top() 

0(1) 

Time Complexity of IsEmptyStack() 

0(1) 

Time Complexity of DeleteStackQ 

O(n) 


4.6 Comparison of Implementations 

Comparing Incremental Strategy and Doubling Strategy 

We compare the incremental strategy and doubling strategy by analyzing the total time T(ri) needed to perform a 
series of n push operations. We start with an empty stack represented by an array of size 1. We call amortized time of 
a push operation is the average time taken by a push over the series of operations, i.e., T(ri)/n. 

Incremental Strategy: The amortized time (average time per operation) of a push operation is O(n) [0(n 2 )/ n]. 

Doubling Strategy: In this method, the amortized time of a push operation is 0(1) [O(n) / n]. 

Note: For reasoning, refer implementation section. 

Comparing Array Implementation and linked List Implementation 

Array Implementation 

• Operations take constant time, 

• Expensive doubling operation every once in a while. 

• Any sequence of n operations (starting from empty stack) — "amortized" bound takes time proportional to n. 
Linked list Implementation 

• Grows and shrinks gracefully. 

• Every operation takes constant time 0(1). 
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• Every operation uses extra space and time to deal with references. 


4.7 Problems on Stacks 


Problem-1 Discuss how stacks can be used for checking balancing of symbols? 

Solution: Stacks can be used to check whether the given expression has balanced symbols or not. This algorithm is 
very much useful in compilers. Each time parser reads one character at a time. If the character is an opening delimiter 
like (, {, or [- then it is written to the stack. When a closing delimiter is encountered like ),}, or ]- is encountered the 
stack is popped. The opening and closing delimiters are then compared. If they match, the parsing of the string 
continues. If they do not match, the parser indicates that there is an error on the line. A linear-time 0(n) algorithm 
based on stack can be given as: 

Algorithm 

a) Create a stack. 

b) while (end of input is not reached) { 

1) If the character read is not a symbol to be balanced, ignore it. 

2) If the character is an opening symbol like (, [, {, push it onto the stack 

3) If it is a closing symbol like ),],}, then if the stack is empty report an error. Otherwise pop the stack. 

4) If the symbol popped is not the corresponding opening symbol, report an error. 

} 

c) At end of input, if the stack is not empty report an error 

Examples:____ 


Example 

Valid? 

Description 

(A+B)+(C-D) 

Yes 

The expression is having balanced symbol 

((A+B)+(C-D) 

No 

One closing brace is missing 

((A+BMC-D]) 

Yes 

Opening and immediate closing braces correspond 

((A+BMC-D]} 

No 

The last closing brace does not correspond with the first opening parenthesis 


For tracing the algorithm let us assume that the input is: () (() [()]) 
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) 

Pop ( 

Test if( and A[i] match? YES 




Test if stack is Empty? YES 


TRUE 


Time Complexity: O(n). Since, we are scanning the input only once. Space Complexity: O(n) [for stack], 

Problem-2 Discuss infix to postfix conversion algorithm using stack? 

Solution: Before discussing the algorithm, first let us see the definitions of infix, prefix and postfix expressions. 

Infix: An infix expression is a single letter, or an operator, proceeded by one infix string and followed by another Infix 
string. 

A 

A+B 

(A+B)+ (C-D) 

Prefix: A prefix expression is a single letter, or an operator, followed by two prefix strings. Every prefix string longer 
than a single variable contains an operator, first operand and second operand. 

A 

+AB 

++AB-CD 

Postfix: A postfix expression (also called Reverse Polish Notation) is a single letter or an operator, preceded by two 
postfix strings. Every postfix string longer than a single variable contains first and second operands followed by an 
operator. 

A 

AB+ 

AB+CD—h 

Prefix and postfix notions are methods of writing mathematical expressions without parenthesis. Time to evaluate a 
postfix and prefix expression is O(n), were n is the number of elements in the array. 


Infix 

Prefix 

Postfix 

A+B 

+AB 

AB+ 

A+B-C 

-+ABC 

AB+C- 

(A+B)*C-D 

-*+ABCD 

AB+C*D- 


Now, let us concentrate on the algorithm. In infix expressions, the operator precedence is implicit unless we use 
parentheses. Therefore, for the infix to postfix conversion algorithm we have to define the operator precedence (or 
priority) inside the algorithm. The table shows the precedence and their associatively (order of evaluation) among 
operators. 


Token 

Operator 

Precedence 

Associatively 

0 

function call 

17 

left-to-right 

[] 

array element 



-+. 

struct or union member 



— ++ 

increment, decrement 

16 

left-to-right 

~ ++ 

decrement, increment 

15 

right-to-left 

! 

logical not 



- 

one’s complement 



“ + 

unary minus or plus 



&* 

address or indirection 



sizeof 

size (in bytes) 



(type) 

type cast 

14 

right-to-left 

*/% 

multiplicative 

13 

Left-to-right 
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+ - 

binary add or subtract 

12 

left-to-right 

« » 

shift 

11 

left-to-right 

> >= 

< <= 

relational 

10 

left-to-right 

= != 

equality 

9 

left-to-right 

& 

bitwise and 

8 

left-to-right 

A 

bitwise exclusive or 

7 

left-to-right 

| 

bitwise or 

6 

left-to-right 

&& 

logical and 

5 

left-to-right 

II 

logical or 

4 

left-to-right 

?: 

conditional 

3 

right-to-left 

= +=-=/=*=%= 

<<= »= 

&= A = 

assignment 

2 

right-to-left 

> 

Comma 

1 

left-to-right 


Important Properties 

• Let us consider the infix expression 2 + 3*4 and its postfix equivalent 234*+. Notice that between infix and 
postfix the order of the numbers (or operands) is unchanged. It is 2 3 4 in both cases. But the order of the 
operators * and + is affected in the two expressions. 

• Only one stack is enough to convert an infix expression to postfix expression. The stack that we use in the 
algorithm will be used to change the order of operators from infix to postfix. The stack we use will only 
contain operators and the open parentheses symbol “( • Postfix expressions do not contain parentheses. We 
shall not output the parentheses in the postfix output. 


Algorithm 

a) Create a stack 

b) for each character t in the input stream{ 

if(t is an operand) 
output t 

else if(t is a right parenthesis)! 

Pop and output tokens until a left parenthesis is popped (but not output) 

1 

else //1 is an operator or left parenthesis! 

pop and output tokens until one of lower priority than t is encountered or a left parenthesis is 
encountered or the stack is empty 
Push t 

1 

) 

c) pop and output tokens until the stack is empty 


Input Character 

Operation on Stack 

Stack 

Postfix Expression 

A 


Empty 

A 

* 

Push 

* 

A 

B 


* 

AB 

- 

Check and Push 

- 

AB* 

( 

Push 

-( 

AB* 

C 



AB*C 

+ 

Check and Push 

mm 

AB*C 
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D 



AB*CD 

) 

Pop and append to postfix till ‘(’ 

- 

AB*CD+ 

+ 

Check and Push 

+ 

AB*CD+- 

E 


+ 

AB*CD+-E 

End of input 

Pop till empty 


AB*CD+-E+ 


Problem-3 Discuss postfix evaluation using stacks? 

Solution: 

Algorithm 

1 Scan the Postfix string from left to right. 

2 Initialize an empty stack. 

3 Repeat the below steps 4 and 5 till all the characters are scanned. 

4 If the scanned character is an operand, push it onto the stack. 

5 If the scanned character is an operator, and if the operator is unary operator then pop an element from the 
stack. If the operator is binary operator then pop two elements from the stack. After popping the elements, 
apply the operator to those popped elements. Let the result of this operation be retVal onto the stack. 

6 After all characters are scanned, we will have only one element in the stack. 

7 Return top of the stack as result. 

Example: Let us see how the above algorithm works using an example. Assume that the postfix string is 123*+5-. 
Initially the stack is empty. Now, the first three characters scanned are 1, 2 and 3, which are operands. They will be 
pushed into the stack in that order. 



Stack 

Next character scanned is which is an operator. Thus, we pop the top two elements from the stack and perform the 
operation with the two operands. The second operand will be the first element that is popped. 



Stack 

The value of the expression (2*3) that has been evaluated (6) is pushed into the stack. 



Stack 
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Next character scanned is which is an operator. Thus, we pop the top two elements from the stack and perform the 
"+" operation with the two operands. The second operand will be the first element that is popped. 

1+6 = 7 

Expression 


Stack 

The value of the expression (1+6) that has been evaluated (7) is pushed into the stack. 


7 


Stack 


Expression 


Next character scanned is "5", which is added to the stack. 


5 


7 


Expression 


Stack 

Next character scanned is which is an operator. Thus, we pop the top two elements from the stack and perform the 
operation with the two operands. The second operand will be the first element that is popped. 


7-5 = 2 


Expression 


Stack 


The value of the expression(7-5) that has been evaluated(23) is pushed into the stack. 



Expression 


Stack 
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Now, since all the characters are scanned, the remaining element in the stack (there will be only one element in the 
stack) will be returned. End result: 

• Postfix String : 123*+5- 

• Result: 2 

Problem-4 Can we evaluate the infix expression with stacks in one pass? 

Solution: Using 2 stacks we can evaluate an infix expression in 1 pass without converting to postfix. 

Algorithm 

1) Create an empty operator stack 

2) Create an empty operand stack 

3) For each token in the input string 

a. Get the next token in the infix string 

b. If next token is an operand, place it on the operand stack 

c. If next token is an operator 

i. Evaluate the operator (next op) 

4) While operator stack is not empty, pop operator and operands (left and right), evaluate left operator right and 
push result onto operand stack 

5) Pop result from operator stack 

Problem-5 How to design a stack such that GetMinimum() should be 0(1)? 

Solution: Take an auxiliary stack which maintains the minimum of all values in the stack. Also, assume that, each 
element of the stack is less than its below elements. For simplicity let us call the auxiliary stack as min stack. 

When we pop the main stack, pop the min stack too. When we push the main stack, push either the new element or 
the current minimum, whichever is lower. At any point, if we want to get the minimum then we just need to return 
the top element from the min stack. Let us take some example and trace out. Initially let us assume that we have 
pushed 2, 6, 4,1 and 5. Based on above algorithm the min stack will look like: 


Main stack 

Min stack 

5 —► top 

1 —> top 

1 

1 

4 

2 

6 

2 

2 

2 


After popping twice we get: 


Main stack 

Min stack 

4 —> top 

2 —> top 

6 

2 

2 

2 


Based on the above discussion, now let us code the push, pop and GetMinimum() operations. 

struct AdvancedStackf 

struct Stack elementStack; 
struct Stack minStack; 

}; 

void Push(struct AdvancedStack *S, int data ){ 

Push (S—►elementStack, data); 

if(IsEmptyStack(S—»minStack) || Top(S—>minStack) >= data) 

Push (S—►minStack, data); 
else Push (S—»minStack, Top(S—>minStack)); 

) 

int Pop(struct AdvancedStack *S ){ 
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int temp; 

if(IsEmptyStack(S—>elementStack)) 
return-1; 

temp = Pop (S—>elementStack); 

Pop (S—*minStack); 
return temp; 

} 

int GetMinimum(struet AdvancedStack *S){ 
return Top(S—>minStack); 

] 

struct AdvancedStack *CreateAdvancedStack(){ 

struct AdvancedStack *S = (struct AdvancedStack *)malIoc(sizeof(struct AdvancedStack)); 

if(!S) 

return NULL; 

S—>elementStack = CreateS tack(); 

S-+minStack = CreateStackQ; 
return S; 

} 

Time complexity: 0(1). Space complexity: 0(n) [for Min stack]. This algorithm has much better space usage if we 
rarely get a "new minimum or equal' 1 . 

Problem-6 For the Problem-5 is it possible to improve the space complexity? 

Solution: Yes. The main problem of previous approach is, for each push operation we are pushing the element on to 
min stack also (either the new element or existing minimum element). That means, we are pushing the duplicate 
minimum elements on to the stack. 


Now, let us change the above algorithm to improve the space complexity. We still have the min stack, but we only pop 
from it when the value we pop from the main stack is equal to the one on the min stack. We only push to the min 
stack when the value being pushed onto the main stack is less than or equa to the current min value. In this modified 
algorithm also, if we want to get the minimum then we just need to return the top element from the min stack. For 


Main stack 

Min stack 

1 —*top 


5 

1 

4 

1 —> top 

6 

1 

2 

2 


Popping from the above pops 


from both stacks because 1 == 1, leaving: 


Main stack 

Min stack 

5 —►top 


1 

4 

6 

1 —> top 

2 

2 


Popping again only pops from the main stack, because 5 > 1: 



Main stack 

Min stack 



1 —>top 




4 




6 

1 —►top 
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Popping again pops both stacks because 1 == 1: 


Main stack 

Min stack 

4 —► top 


6 

2 

2 —»top 


Note: The difference is only in push & pop operations. 

struct AdvancedStack { 

struct Stack elementStack; 
struct Stack minStack; 
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void Push(struct AdvancedStack *S, int data){ 

Push (S—»elementStack, data); 

if(IsEmptyStack(S—> minStack) 11 Top(S—>minStack) >= data) 
Push (S—>minStack, data); 

} 

int Pop(struct AdvancedStack *S){ 
int temp; 

if(IsEmptyStack(S—>elementStack)) return -1; 

temp = Top (S—>elementStack); 

if(Top(S—> minStack) == Pop(S—>elementStack)) 

Pop (S—► minStack); 
return temp; 

] 

int GetMinimum(struct AdvancedStack *S){ 
return Top(S--> minStack); 

} 


Struct AdvancedStack * AdvancedStack(){ 

struct AdvancedStack *S = (struct AdvancedStack) malloc (sizeof (struct AdvancedStack)); 
if(!S) return NULL; 

S—>elementStack = CreateStackQ; 

S—^-minStack = CreateStack(); 
return S; 


] 


Time complexity: 0(1). Space complexity: 0(n) [for Min stack]. But this algorithm has much better space usage if we 
rarely get a "new minimum or equal". 


Problem-7 For a given array with n symbols how many stack permutations are possible? 

Solution: The number of stack permutations with n symbols is represented by Catalan number and we will discuss 
this in Dynamic Programming chapter. 

Problem-8 Given an array of characters formed with a’s and b’s. The string is marked with special character X 

which represents the middle of the list (for example: ababa...ababXbabab.baaa). Check whether the string is 

palindrome or not? 


Solution: This is one of the simplest algorithms. What we do is, start two indexes one at the beginning of the string 
and other at the ending of the string. Each time compare whether the values at both the indexes are same or not. If the 
values are not same then we say that the given string is a palindrome. If the values are same then increment the left 
index and decrement the right index. Continue this process until both the indexes meet at the middle (at X) or if the 
string is not palindrome. 


4.7 Problems on Stacks 


89 





Data Structures and Algorithms Made Easy 


Stacks 


int IsPalindrome(char *A){ 

int i=0, j = strlen(A)-l; 
while(i < j && A[i] == A[j]) { 
i++; 
j--; 

} 

if(i < j) { 

printf("Not a Palindrome"); 
return 0; 

1 

else { printf(“Palindrome"); 
return 1; 

} 

1 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-9 For the Problem-8, if the input is in singly linked list then how do we check whether the list 
elements form a palindrome or not? (That means, moving backward is not possible). 

Solution: Refer Linked Lists chapter. 

Problem-10 Can we solve Problem-8 using stacks? 

Solution: Yes. 

Algorithm 

• Traverse the list till we encounter X as input element. 

• During the traversal push all the elements (until X) on to the stack. 

• For the second half of the list, compare each elements content with top of the stack. If they are same then pop 
the stack and go to the next element in the input list. 

• If they are not same then the given string is not a palindrome. 

• Continue this process until the stack is empty or the string is not a palindrome. 

int IsPalindrome(char *A){ 
int i=0; 

struct Stack S= CreateStackQ; 
while(A[i] != X') { 

Push(S, A[i]); 
i++; 

} 

i++; 

while(A[i]) { 

if(IsEmptyStack(S) ||A[i] != Pop(S)) { 
printffNot a Palindrome"); 
return 0; 

] 

i++; 

} 

return IsEmptyStack(S); 

3 

Time Complexity: 0(n). Space Complexity: 0(n/2) »0(n). 

Problem-11 Given a stack, how to reverse the elements of stack by using only stack operations (push & pop)? 
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Solution: Algorithm 

• First pop all the elements of the stack till it becomes empty. 

• For each upward step in recursion, insert the element at the bottom of stack. 

void ReverseStack(struct Stack *S){ 
int data; 

if(IsEmptyStack(S)) return; 
data = Pop(S); 

ReverseStack(S); 

InsertAtBottom(S, data); 

} 

void InsertAtBottom(struct Stack *S, int data){ 
int temp; 

if(IsEmptyStack(S)) { 

Push(S, data); 
return; 

} 

temp = Pop(S); 

InsertAtBottom(S, data); 

Push(S, temp); 

3 

Time Complexity: 0(n 2 ). Space Complexity: 0(n), for recursive stack. 

Problem-12 Show how to implement one queue efficiently using two stacks. Analyze the running time of the 
queue operations. 

Solution: Refer Queues chapter. 

Problem- 13 Show how to implement one stack efficiently using two queues. Analyze the running time of the 
stack operations. 

Solution: Refer Queues chapter. 

Problem-14 How do we implement 2 stacks using only one array? Our stack routines should not indicate an 
exception unless every slot in the array is used? 

Solution: 


Algorithm: 

• Start two indexes one at the left end and other at the right end. 

• The left index simulates the first stack and the right index simulates the second stack. 

• If we want to push an element into the first stack then put the element at left index. 

• Similarly, if we want to push an element into the second stack then put the element at right index. 

• First stack gets grows towards right, second stack grows towards left. 

Time Complexity of push and pop for both stacks is 0(1). Space Complexity is 0(1). 

Problem-15 3 stacks in one array: How to implement 3 stacks in one array? 

Solution: For this problem, there could be other way of solving it. Below is one such possibility and it works as long as 
there is an empty space in the array. 
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Stack-1 I Stack-3 

Topi Top3 Top2 

To implement 3 stacks we keep the following information. 

• The index of the first stack (Topi): this indicates the size of the first stack. 

• The index of the second stack (Top2): this indicates the size of the second stack. 

• Starting index of the third stack (base address of third stack). 

• Top index of the third stack. 

Now, let us define the push and pop operations for this implementation. 


f 




Stack-2 


Pushing: 

• For pushing on to the first stack, we need to see if adding a new element causes it to bump into the third 
stack. If so, try to shift the third stack upwards. Insert the new element at (startl + Topi). 

• For pushing to the second stack, we need to see if adding a new element causes it to bump into the third stack. 
If so, try to shift the third stack downward. Insert the new element at (start2 - Top2). 

• When pushing to the third stack, see if it bumps the second stack. If so, try to shift the third stack downward 
and try pushing again. Insert the new element at (start3 + Top3). 

Time Complexity: O(n). Since, we may need to adjust the third stack. Space Complexity: 0(1). 

Popping: For popping, we don’t need to shift, just decrement the size of the appropriate stack. 

Time Complexity: 0(1). Space Complexity: 0(1). 

Problem-16 For Problem-15, is there any other way implementing middle stack? 

Solution: Yes. When either the left stack (which grows to the right) or the right stack (which grows to the left) bumps 
into the middle stack, we need to shift the entire middle stack to make room. The same thing happens if a push on the 
middle stack causes it to hump into the right stack. To solve the above problem (number of shifts) what we can do is, 
alternating pushes could be added at alternating sides of the middle list (For example, even elements are pushed to the 
left, odd elements are pushed to the right).This would keep the middle stack balanced in the center of the array but it 
would still need to be shifted when it bumps into the left or right stack, whether by growing on its own or by the 
growth of a neighboring stack. 

We can optimize the initial locations of the three stacks if they grow/shrink at different rates and if they have 
different average sizes. For example, suppose one stack doesn't change much. If you put it at the left then the middle 
stack will eventually get pushed against it and leave a gap between the middle and right stacks, which grow toward 
each other. If they collide, then it's likely you've run out of space in the array. There is no change in the time 
complexity but the average number of shifts will get reduced. 

Problem-17 Multiple ( m ) stacks in one array: As similar to Problem-15, what if we want to implement m stacks 
in one array? 

Solution: Let us assume that array indexes are from 1 to n. As similar to the discussion of Problem-15, to implement m 
stacks in one array, we divide the array into m parts (as shown below). The size of each part is —. 

n 2n 


T 

Base[m+1] 

Top[m+l] 


T 

Base[3] 

Top[3] 


T 

Base[l] 


J 

Base[2] 


Top[l] 


Top[2] 
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From the above representation we can see that, first stack is starting at index 1 (starting index is stored in Base[l]), 
second stack is starting at index ^ (starting index is stored in Base[2]), third stack is starting at index ^ (starting index 

is stored in Base[3]) and so on. Similar to Base array, let us assume that Top array stores the top indexes for each of the 
stack. Consider the following terminology for the discussion. 

• Top[i], for 1 < i < m will point to the topmost element of the stack i. 

• If Base[i] == Top[i], then we can say the stack i is empty. 

• If Top[i] == Base[i+1], then we can say the stack i is full. 

Initially Base[i] = Topfi] = ^ (i - 1), for 1 < i < m. 

• The i th stack grows from Base[i]+1 to Base[i+1], 

Pushing on to i th stack: 

1) For pushing on to the i th stack, we check whether top of i th stack is pointing to Base[i+1] (this case defines 
that i th stack is full). That means, we need to see if adding a new element causes it to bump into the i + 1 th 
stack. If so, try to shift the stacks from i + 1 th stack to m th stack towards right. Insert the new element at 
(Base[i] + Top[i]). 

2) If right shifting is not possible then try shifting the stacks from 1 to i — 1 th stack towards left. 

3) If both of them are not possible then we can say that all stacks are full. 

void Push(int StackID, int data) { 
if(Top[i] == Base[i+1]) 

Print i th Stack is full and does the necessary action (shifting); 

Top[i] = Top[i]+l; 

A[Top[i]] = data; 

} 

Time Complexity: O(n). Since, we may need to adjust the stacks. Space Complexity: 0(1). 

Popping from I th stack: For popping, we don’t need to shift, just decrement the size of the appropriate stack. The only 
case to check is stack empty case. 

int Pop(int StackID) { 

if(Top[i] == Base[i]) 

Print i th Stack is empty; 
return A[Top[i]—]; 

J 

Time Complexity: 0(1). Space Complexity: 0(1). 

Problem-18 Consider an empty stack of integers. Let the numbers 1, 2,3,4, 5, 6 be pushed on to this stack only in 
the order they appeared from left to right. Let S indicates a push and X indicates a pop operation. Can they be 
permuted in to the order 325641(output) and order 154623? (If a permutation is possible give the order string of 
operations. 

Solution: SSSXXSSXSXXX outputs 325641. 154623 cannot be output as 2 is pushed much before 3 so can appear only 
after 3 is output. 


Problem-19 Earlier of this chapter, we have seen that, for dynamic array implementation of stack, we have used 
repeated doubling approach. For the same problem what is the complexity if we create a new array whose size is 
n + K instead of doubling? 


Solution: Let us assume that the initial stack size is 0. For simplicity let us assume that K = 10. For inserting the 
element we create a new array whose size is 0 + 10 = 10. Similarly, after 10 elements we again create a new array 
whose size is 10 + 10 = 20 and this process continues at values: 30,40 ... That means, for a given n value, we are 
creating the new arrays at: The total number of copy operations are: 


—-1-f- - -J- • ■ 

10 20 30 


,1 = w6 + I + J + -9 = ^ l09n * °( n ^n) 
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If we are performing n push operations, the cost of per operation is O(logn). 

Problem-20 Given a string containing n S's and n X's where S indicates a push operation and X indicates a pop 
operation, and with the stack initially empty, Formulate a rule to check whether a given string 5 of operations is 
admissible or not? 

Solution: Given a string of length 2n, we wish to check whether the given string of operations is permissible or not 
with respect to its functioning on a stack. The only restricted operation is pop whose prior requirement is that the 
stack should not be empty. So while traversing the string from left to right, prior to any pop the stack shouldn't be 
empty which means the no of S's is always greater than or equal to that of X's. Hence the condition is at any stage on 
processing of the string, number of push operations (S) should be greater than number of pop operations (X). 

Problem-21 Suppose there are two singly linked lists both of which intersect at some point and become a single 
linked list. The head or start pointers of both the lists are known, but the intersecting node is not known. Also, the 
number of nodes in each of the list before they intersect are unknown and both list may have it different. Listl 
may have n nodes before it reaches intersection point and List2 might have m nodes before it reaches intersection 
point where m and n may be m = n,m < norm > n. Can we find the merging point using stacks? 



Solution: Yes. For algorithm refer Linked Lists chapter. 

Problem-22 Finding Spans: Given an array A the span 5[i] of ^4[i] is the maximum number of consecutive 
elements A[j] immediately preceding 4 [i] and such that A\j] <A[i]7 

Solution: I 
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This is a very common problem in stock markets to find the peaks. Spans have applications to financial analysis (E.g., 
stock at 52-week high). The span of a stocks price on a certain day, i, is the maximum number of consecutive days (up 
to the current day) the price of the stock has been less than or equal to its price on i. As an example, let us consider the 
following table and the corresponding spans diagram. In the figure the arrows indicates the length of the spans. Now, 
let us concentrate on the algorithm for finding the spans. One simple way is, each day, check how many contiguous 
days are with less stock price than current price. 


Algorithm: FindingSpans(int A[],int n) { 

//Input: array A of n integers, Output: array S of spans of A 
int i, j, S[n]; //new array of n integers; 
for (i = 0; i < n; i++) { 

j = i; 

while j <= i && A[i] > A[i-j] 

j = j + i; 

S[i] = j; 

1 

return S; 

1 

Time Complexity: 0(n 2 ). Space Complexity: 0(1). 


//Executes n times 
n 

1 + 2 + ...+ (n - 1) 
1 + 2 + ...+ (n - 1) 
n 

1 


Problem-23 Can we improve the complexity of Problem-22? 

Solution: From the above example, we can see that the span 5[i] on day i can be easily calculated if we know the 
closest day preceding i, such that the price is greater than on that day than the price on day i. Let us call such a day as 
P. If such a day exists then the span is now defined as S[i] = i — P. 

Algorithm: FindingSpans(int A[], int n) { 

struct stack *D = CreateStackQ; 
int P; 

for (int i = 0 i< n; i++) { 

while (IIsEmptyStack(D)) { 
if(A[i] > A[Top(D)]) 

Pop(D); 

} 

if(IsEmptyStack(D)) 

P = -1; 

else P = Top(D); 

S[i] = i-P; 

Push(D, i); 

} 

return S; 

} 

Time Complexity: Each index of the array is pushed into the stack exactly one and also popped from the stack at most 
once. The statements in the while loop are executed at most n times. Even though the algorithm has nested loops, the 
complexity is O(n) as the inner loop is executing only n times during the course of algorithm (trace out an example 
and see how many times the inner loop is becoming success). Space Complexity: 0(n) [for stack], 

Problem-24 Largest rectangle under histogram: A histogram is a polygon composed of a sequence of rectangles 
aligned at a common base line. For simplicity, assume that the rectangles are having equal widths hut may have 
different heights. For example, the figure on the left shows the histogram that consists of rectangles with the 
heights 3,2,5,6,1,4,4, measured in units where 1 is the width of the rectangles. Here our problem is: given an 
array with heights of rectangles (assuming width is 1), we need to find the largest rectabgle possible. For the given 
example the largest rectangle is the shared part. 
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Solution: A straightforward answer is to go for each bar in the histogram and find the maximum possible area in 
histogram for it. Finally, find the maximum of these values. This will require 0(n z ). 

Problem-25 For Problem-24, can we improve the time complexity? 

Solution: Linear search using a stack of incomplete subproblems: There are many ways of solving this problem. Judge 
has given a nice algorithm for this problem which is based on stack. Process the elements in left-to-right order and 
maintain a stack of information about started but yet unfinished sub histograms. 

If the stack is empty, open a new subproblem by pushing the element onto the stack. Otherwise compare it to the 
element on top of the stack. If the new one is greater we again push it. If the new one is equal we skip it. In all these 
cases, we continue with the next new element. If the new one is less, we finish the topmost subproblem by updating 
the maximum area with respect to the element at the top of the stack. Then, we discard the element at the top, and 
repeat the procedure keeping the current new element. This way, all subproblems are finished until the stack becomes 
empty, or its top element is less than or equal to the new element, leading to the actions described above. If all 
elements have been processed, and the stack is not yet empty, we finish the remaining subproblems by updating the 
maximum area with respect to to the elements at the top. 

struct Stackltem { 
int height; 
int index; 

}; 

int MaxRectangleArea(int A[], int n) { 

int i, maxArea=-l, top = -1, left, currentArea; 

struct Stackltem *S = (struct Stackltem *) malloc(sizeof(struct Stackltem) * n); 
for(i=0; i<=n; i++) { 

while(top >= 0 && (i==n || S[top]—>data > A[i])) { 
if(top > 0) 

left = S[top-l]—nndex; 
else left = -l; 

currentArea = (i - left-1) * S[top]—>data; 

—top; 

if(currentArea > maxArea) 

maxArea = currentArea; 

) 

if(i<n) { ++top; 

S[top]—>data = A[i]; 

S[top]—tindex = i; 

} 

} 

return maxArea; 

} 

In first impression, this solution seems to be having 0(n 2 ) complexity. But if we look carefully, every element is 
pushed and popped at most once and in every step of the function at least one element is pushed or popped. Since the 
amount of work for the decisions and the update is constant, the complexity of the algorithm is O(n) by amortized 
analysis. Space Complexity: O(n) [for stack]. 
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& 


5.1 What is a Queue? 

A queue is a data structure used for storing data (similar to Linked Lists and Stacks). In queue, the order in which the 
data arrives is important. In general, a queue is a line of people or things waiting to be served in sequential order 
starting at the beginning of the line or sequence. 

Definition: A queue is an ordered list in which insertions are done at one end {rear) and deletions are done at other 
end (front). The first element to be inserted is the first one to be deleted. Hence, it is called as First in First out (FIFO) 
or Last in Last out (LILO) list. 

Similar to Stacks, special names are given to the two changes that can be made to a queue. When an element is 
inserted in a queue, the concept is called as EnQueue, and when an element is removed from the queue, the concept is 
called as DeQueue. Trying to DeQueue an empty queue is called as underflow and trying to EnQueue an element in 
a full queue is called as overflow. Generally, we treat them as exceptions. As an example, consider the snapshot of the 
queue. 



(DeQueue) (EnQueue) 

5.2 How is Queues Used? 

Line at reservation counter explains the concept of a queue. When we enter the line we put ourselves at the end of the 
line and the person who is at the front of the fine is the next who will be served. The person will exit the queue and 
will be served. 

In the meanwhile the queue is served and next person at head of the line will exit the queue and will be served. While 
the queue is served, we will move towards the head of the line since each person that is served will be removed from 
the head of the queue. Finally we will reach head of the line and we will exit the queue and be served. This behavior is 
very useful in any cases where there is need to maintain the order of arrival. 

5.3 Queue ADT 

The following operations make a queue an ADT. Insertions and deletions in queue must follow the FIFO scheme. For 
simplicity we assume the elements are integers. 

Main Queue Operations 

• EnQueue(int data): Inserts an element at the end of the queue 

• int DeQueueQ: Removes and returns the element at the front of the queue 

Auxiliary Queue Operations 

• int FrontQ: Returns the element at the front without removing it 
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• int QueueSize(): Returns the number of elements stored 

• int IsEmptyQueue(): Indicates whether no elements are stored 

5.4 Exceptions 

As similar to other ADTs attempting execution of DeQueue on an empty queue throws an “Empty Queue Exception “ 
and attempting execution of EnQueue on an full queue throws an “Full Queue Exception “. 

5.5 Applications 

Following are the some of the applications in which queues are being used. 

Direct Applications 

• Operating systems schedule jobs (with equal priority) in the order of arrival (e.g., a print queue). 

• Simulation of real-world queues such as lines at a ticket counter or any other first-come first-served scenario 
requires a queue. 

• Multiprogramming. 

• Asynchronous data transfer (file IO, pipes, sockets). 

• Waiting times of customers at call center. 

• Determining number of cashiers to have at a supermarket. 

Indirect Applications 

• Auxiliary data structure for algorithms 

• Component of other data structures 

5.6 Implementation 

There are many ways (similar to Stacks) of implementing queue operations and below are commonly used methods. 

• Simple circular array based implementation 

• Dynamic circular array based implementation 

• Linked lists implementation 

Why Circular Arrays? 

First, let us see whether we can use simple arrays for implementing queues which we have done for stacks. We know 
that, in queues, the insertions are performed at one end and deletions are performed at other end. After some 
insertions and deletions it is easy to get the situation as shown below. It can be seen clearly that, the initial slots of the 
array are getting wasted. So, simple array implementation for queue is not efficient. To solve this problem we assume 
the arrays as circular arrays. That means, we treat last element and first array elements are contiguous. With this 
representation, if there are any free slots at the beginning, the rear pointer can easily go to its next free slot. 


New elements ready 
to enter Queue 
(enQueue) 

Note: The simple circular array and dynamic circular array implementations are very much similar to stack array 
implementations. Refer Stacks chapter for analysis of these implementations. 


T 

rear 



r 

front 


5.4 Exceptions 


98 




Data Structures and Algorithms Made Easy 


Queues 


Simple Circular Array Implementation 



This simple implementation of Queue ADT uses an array. In the array, we add elements circularly and use two 
variables to keep track of start element and end element. Generally, front is used to indicate the start element and 
rear is used to indicate the end element in the queue. The array storing the queue elements may become full. An 
EnQueue operation will then throw a full queue exception. Similarly, if we try deleting an element from empty 
queue then it will throw empty queue exception. 

Note: Initially, both front and rear points to -1 which indicates that the queue is empty. 

struct ArrayQueue { 

int front, rear; 
int capacity; 
int ‘array; 

}; 

struct ArrayQueue *Queue(int size) { 

struct ArrayQueue *Q_= malloc(sizeof(struct ArrayQueue)); 
if(!Q) return NULL; 

Q_—^-capacity = size; 

Q.—>front = Q_—>rear = -1; 

Q.—>array= malloc(Q_—^capacity * sizeof(int)); 
if(!Q_—»-array) 

return NULL; 
return Q; 

} 

int IsEmptyQueue(struct ArrayQueue *Q) { 

//if the condition is true then 1 is returned else 0 is returned 
return (Q;->front == -1); 

} 

int IsFullQueue(struct ArrayQueue *Q) { 

//if the condition is true then 1 is returned else 0 is returned 
return ((Q.->rear +1) % Q_—>capacity == Q_—>front); 

} 

int QueueSize() { 

return (Q.—^-capacity - Q_—>front + Q_—>rear + 1)% Q_—^capacity; 

} 

void EnQueue(struct ArrayQueue *Q, int data) { 
if(IsFullQueue(Q)) 

printf(“Queue Overflow”); 
else { Q_—>rear = (Q_—>rear+l) % Q,—^capacity; 

Qr> array[Q->rear]= data; 
if(Q—>front == -1) 
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Q.—►front = Q,—►rear; 

} 

} 

int DeQueue(struct ArrayQueue *Q) { 

int data = 0;//or element which does not exist in Queue 
if(IsEmptyQueue(Q)) { 

printf("Queue is Empty"); 
return 0; 

} 

else { data = Q,—►array[Q.—►front]; 
if(Q.—►front == Q.—►rear) 

Q.-»front = Q.—►rear = -1; 
else Q_—> front = (Q.—►front+1) % Q.—^capacity; 

1 

return data; 

3 

void DeleteQueue (struct ArrayQueue *Q) { 

if(Q) { 

if(Q_—►array) 

free(Q.—►array); 

free(Q); 

} 

} 

Performance & Limitations 

Performance: Let n be the number of elements in the queue:_ 


Space Complexity (for n EnQueue operations) 

0(n) 

Time Complexity of EnQueueQ 

0(1) 

Time Complexity of DeQueue() 

0(1) 

Time Complexity of IsEmptyQueue() 

0(1) 

Time Complexity of IsFullQueue() 

0(1) 

Time Complexity of QueueSize() 

0(1) 

Time Complexity of DeleteQueue() 

0(1) 


Limitations: The maximum size of the queue must be defined a prior and cannot be changed. Trying to EnQueue a 
new element into a full queue causes an implementation-specific exception. 

Dynamic Circular Array Implementation 

struct DynArrayQueue { 
int front, rear; 
int capacity; 
int ‘array; 

1 ; 

struct DynArrayQueue *CreateDynQueue() { 

struct DynArrayQueue *Q.= malloc(sizeof(struct DynArrayQueue)); 
if(!Q) return NULL; 

Q,—►capacity = 1; 

Q.—►front = Q.—►rear = -1; 

Q—>array= malloc(Q.—^capacity * sizeof(int)); 

if(!Q_—►array)______ 


5.6 Implementation 


100 



Data Structures and Algorithms Made Easy 


Queues 


return NULL; 
return Q; 

3 

int IsEmpty Queue (struct DynArrayQueue *Q) { 

// if the condition is true then 1 is returned else 0 is returned 
return (Q—(front == -1); 

3 

int IsFullQueue(struct DynArrayQueue *Q) { 

//if the condition is true then 1 is returned else 0 is returned 
return ((Q,—(rear +1) % Q_—>capacity == Q_—(front); 

3 

int QueueSizeQ { 

return (Q.—(capacity - Q_—(front + Q.->rear + 1)% Q_—►capacity; 

} 

void EnQueue(struct DynArrayQueue *Q, int data) { 
if(IsFullQueue(Q)) 

ResizeQueue(Q); 

Q_—(rear = (Q_-^rear+l)% Q_—(capacity; 

Q.—» array[Q r ->rear]= data; 
if(Q.—(front == -1) 

Q.—>front = Q_—>rear; 

] 

void ResizeQueue(struct DynArrayQueue *Q) { 
int size = Q_—(capacity; 

Q,—(capacity = Q_—(capacity*2; 

Q.—(-array = realloc (Q.—(-array, Q.—(-capacity); 
if(!Q.—(-array) { 

printf(“Memory Error”); 
return; 

} 

if(Q.—(front > Q_—>rear) { 

for(int i=0; i < Q.-^front; i++) { 

Q.—(array [i+size] =Q_—(array [i]; 

} 

Q_—(rear = Q_—^rear + size; 

} 

3 

int DeQueue(struct DynArrayQueue *Q) { 

int data = 0;//or element which does not exist in Queue 
if(IsEmptyQueue(Q)) { 

printf("Queue is Empty"); 
return 0; 

} 

else { data = Q.—(array[Q.—(front]; 
if(Q_—(front== Q^rear) 

Q.—(front= Q_—(rear = -1; 

else 

Q.—(front = (Q_—(front+1) % Q_—(capacity; 

} 

return data; 
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1 

void DeleteQueue(struct DynArrayQueue *Q) { 

if(Q){ 

if(Q.—> array) 

free(Q.—>array); 
free(Q_—>array); 

} 

} 

Performance 

Let n be the number of elements in the queue. 


Space Complexity (for n EnQueue operations) 

O(n) 

Time Complexity of EnQueue() 

0(1) (Average) 

Time Complexity of DeQueue() 

0(1) 

Time Complexity of QueueSize() 

0(1) 

Time Complexity of IsEmptyQueue() 

0(1) 

Time Complexity of IsFullQueue() 

0(1) 

Time Complexity of QueueSize() 

0(1) 

Time Complexity of DeleteQueueQ 

0(1) 


Linked List Implementation 


The other way of implementing queues is by using Linked lists. EnQueue operation is implemented by inserting 
element at the ending of the list. DeQueue operation is implemented by deleting an element from the beginning of the 



front 

struct ListNode { 
int data; 

struct ListNode ‘next; 

); 

struct Queue *CreateQueue() { 
struct Queue *Q; 
struct ListNode ‘temp; 

Q_= malloc(sizeof(struct Queue)); 
if(!Q) return NULL; 
temp = malloc(sizeof(struct ListNode)); 
Q—►front = Q_—>rear = NULL; 
return Q; 


struct Queue { 

struct ListNode ‘front; 
struct ListNode ‘rear; 

1 ; 


} 

int IsEmptyQueue(struct Queue *Q) { 

// if the condition is true then 1 is returned else 0 is returned 


return (Q.—vfront == NULL); 


} 

void EnQueue(struct Queue *Q, int data) { 
struct ListNode ‘newNode; 
newNode = malloc(sizeof(struct ListNode)); 


rear 


5.6 Implementation 


102 






Data Structures and Algorithms Made Easy 


Queues 


if(!newNode) 

return NULL; 
newNode—>data = data; 
newNode—>next = NULL; 

Q.—>rear—»next = newNode; 

Q.—>rear = newNode; 
if(Q.—>front == NULL) 

Qr^►front = Q_—>rear; 

] 

int DeQueue(struct Queue *Q) { 

int data = 0; //or element which does not exist in Queue 
struct ListNode *temp; 
if(IsEmptyQueue(Q)) { 

printff'Queue is empty"); 
return 0; 

} 

else { temp = Q_—vfront; 

data = Q—>ffont—>data; 

Q,—>front== Q-^front—>next; 
free(temp); 

} 

return data; 

1 

void DeleteQueue(struct Queue *Q) { 
struct ListNode *temp; 
while (Q) { 

temp = Q; 

Q.= Q.—>next; 
free(temp); 

1 

free(Q); 

] 

Performance 

Let n be the number of elements in the queue, then_ 


Space Complexity (for n EnQueue operations) 

O(n) 

Time Complexity of EnQueue() 

0(1) (Average) 

Time Complexity of DeQueue() 

0(1) 

Time Complexity of IsEmptyQueue() 

0(1) 

Time Complexity of DeleteQueue() 

0(1) 


Comparison of Implementations 

Note: Comparison is very much similar to stack implementations and Stacks chapter. 

5.7 Problems on Queues 

Problem-1 Give an algorithm for reversing a queue Q. To access the queue, we are only allowed to use the 
methods of queue ADT. 
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Solution: 

void ReverseQueue(struct Queue *Q) { 
struct Stack *S = CreateStack(); 
while (!IsEmptyQueue(Q)) 

Push(S, DeQueue(Q)) 
while (!IsEmptyStack(S)) 

EnQueue(Q, Pop(S)); 

} 

Time Complexity: O(n). 

Problem-2 How to implement a queue using two stacks? 

Solution: Let SI and S2 be the two stacks to be used in the implementation of queue. All we have to do is to define the 
EnQueue and DeQueue operations for the queue, 
struct Queue { 

struct Stack *S1; // for EnQueue 
struct Stack *S2; // for DeQueue 

} 

EnQueue Algorithm 

• Just push on to stack SI 

void EnQueue(struct Queue *Q, int data) { 

Push(Q.—>S1, data); 

} 

Time Complexity: 0(1). 

DeQueue Algorithm 

• If stack S2 is not empty then pop from S2 and return that element. 

• If stack is empty, then transfer all elements from SI to S2 and pop the top element from S2 and return that 
popped element [we can optimize the code little by transferring only n — 1 elements from SI to S2 and pop 
the n th element from SI and return that popped element], 

• If stack SI is also empty then throw error. 

int DeQueue(struct Queue *Q) { 
if(!IsEmptyStack(Q.—>S2)) 

return Pop(Q_—>S2); 
else { while(!IsEmptyStack(Q_—>S1)) 

Push(Q_—*-S2, Pop(Q.-+Sl)); 
return Pop(Q_—>S2); 

} 

1 

Time Complexity: From the algorithm, if the stack S2 is not empty then the complexity is 0(1). If the stack S2 is 
empty then, we need to transfer the elements from SI to S2. But if we carefully observe, the number of transferred 
elements and the number of popped elements from S2 are equal. Due to this the average complexity of pop operation 
in this case is 0(1). Amortized complexity of pop operation is 0(1). 

Problem-3 Show how to efficiently implement one stack using two queues. Analyze the running time of the 
stack operations. 

Solution: Let Q1 and Q2 be the two queues to be used in the implementation of stack. All we have to do is to define 
the push and pop operations for the stack, 
struct Stack { 

struct Queue *Q1; _ 
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struct Queue *Q2; 

} 

In below algorithms, we make sure that one queue is empty always. 

Push Operation Algorithm: Whichever is queue is not empty, push the element into it. 

• Check whether queue Ql is empty or not. If Ql is empty then Enqueue the element into Q2. 

• Otherwise EnQueue the element into Ql. 

Push(struct Stack *S, int data) { 

if(IsEmptyQueue(S—>Q1)) 

EnQueue(S—»Q2, data); 
else EnQueue(S—»Q1, data); 

1 

Time Complexity: 0(1). 

Pop Operation Algorithm: Transfer n — 1 elements to other queue and delete last from queue for performing pop 
operation. 

• If queue Ql is not empty then transfer n — 1 elements from Ql to Q2 and then, DeQueue the last element of 
Ql and return it. 

• If queue Q2 is not empty then transfer n — 1 elements from Q2 to Ql and then, DeQueue the last element of 
Q2 and return it. 

int Pop(struct Stack *S) { 
int i, size; 

if(IsEmptyQueue(S—>Q2)) { 
size = size(S—>Q1); 
i = 0; 

while(i < size-1) { 

EnQueue(S—>Q2, DeQueue(S—>Q1)); 
i++; 

3 

return DeQueue(S—>Q1); 

} 

else { size = size(S—»Q2); 

while(i < size-1) { 

EnQueue(S—>Q1, DeQueue(S—>Q2)); 
i++; 

} 

return DeQueue(S—>Q2); 

) 

} 

Time Complexity: Running time of pop operation is 0(n) as each time pop is called, we are transferring all the 
elements from one queue to other. 

Problem-4 Maximum sum in sliding window: Given array A[] with sliding window of size w which is moving 
from the very left of the array to the very right. Assume that we can only see the w numbers in the window. Each 
time the sliding window moves righ twards by one position. For example: The array is [1 3 -1 -3 5 3 6 7], and w is 3. 


Window position 

Max 

[1 3 -1]-3 5 3 6 7 

3 

1 [3 -1 -3] 5 3 6 7 

3 

1 3 [-1 -3 5] 3 6 7 

5 

1 3 -1 [-3 5 3] 6 7 

5 
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Input: A long array A[], and a window width w. Output: An array B[], B[i] is the maximum value of from A[i] to 
A[i+w-l]. Requirement: Find a good optimal way to get B[i] 

Solution: This problem can be solved with doubly ended queue (which support insertion and deletions at both ends). 
Refer Priority Queues chapter for algorithms. 

Problem-5 Given a queue Q, containing n elements, transfer these items on to a stack S (initially empty) so that 
front element of Q_ appears at the top of the stack and the order of all other items is preserved. Using enqueue and 
dequeue operations for the queue and push and pop operations for the stack, outline an efficient O(n) algorithm to 
accomplish the above task, using only a constant amount of additional storage. 

Solution: Assume the elements of queue Qare a lt a 2 ... a n . Dequeuing all elements and pushing them onto the stack 
will result in a stack with an at the top and a 1 at the bottom. This is done in O(n) time as dequeue and push each 
require constant time per operation. The queue is now empty. By popping all elements and pushing them on the the 
queue we will get % at the top of the stack. This is done again in O (n) time. As in big-oh arithmetic we can ignore 
constant factors, the process is carried out in 0(n) time. The amount of additional storage needed here has to be big 
enough to temporarily hold one item. 

Problem-6 A queue is set up in a circular array A[0..n - 1] with front and rear defined as usual. Assume that 
n — 1 locations in the array are available for storing the elements (with the other element being used to detect 
full/empty condition). Give a formula for the number of elements in the queue in terms of rear, front, and n. 



• Rear of the queue is somewhere clockwise from the front 

• To enqueue an element, we move rear one position clockwise and write the element in that position 

• To dequeue, we simply move front one position clockwise 

• Queue migrates in a clockwise direction as we enqueue and dequeue 

• Emptiness and fullness to be checked carefully. 

• Analyze the possible situations (make some drawings to see where front and rear are when the queue is 

empty, and partially and totally filled). We will get this: 

(rear - front + 1 if rear == front 
Number Of Elements = ( rear _ /nmt + n otherwise 
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Chapter-6 

Trees 

^ 


6.1 What is a Tree? 

A tree is a data structure similar to a linked list but instead of each node pointing simply to the next node in a linear 
fashion, each node points to a number of nodes. Tree is an example of non-linear data structures. A tree structure is a 
way of representing the hierarchical nature of a structure in a graphical form. 

In trees ADT (Abstract Data Type), order of the elements is not important. If we need ordering information linear data 
structures like linked lists, stacks, queues, etc. can be used. 

6.2 Glossary 



• The root of a tree is the node with no parents. There can he at most one root node in a tree (node A in the above 
example). 

• An edge refers to the link from parent to child (all links in the figure). 

• A node with no children is called leaf node ( E,J, K, H and I). 

• Children of same parent are called siblings ( B, C, D are siblings of A and E, F are the siblings of B). 

• A node p is an ancestor of a node q if there exists a path from root to q and p appears on the path. The node q is 
called a descendant of p. For example, A, C and G are the ancestors for K. 

• Set of all nodes at a given depth is called level of the tree ( B, C and D are same level). The root node is at level 



• The depth of a node is the length of the path from the root to the node (depth of G is 2, A — C — G). 
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• The height of a node is the length of the path from that node to the deepest node. The height of a tree is the 
length of the path from the root to the deepest node in the tree. A (rooted) tree with only one node (the root) has 
a height of zero. In the previous example, height of B is 2 (B — F — ]). 

• Height of the tree is the maximum height among all the nodes in the tree and depth of the tree is the 
maximum depth among all the nodes in the tree. For a given tree depth and height returns the same value. But for 
individual nodes we may get different results. 

• Size of a node is the number of descendants it has including itself (size of the subtree C is 3). 

• If every node in a tree has only one child (except leaf nodes) then we call such trees as skew trees. If every node 
has only left child then we call them as left skew trees. Similarly, if every node has only right child then we call 
them as right skew trees. 



6.3 Binary Trees 

A tree is called binary tree if each node has zero child, one child or two children. Empty tree is also a valid binary 
tree. We can visualize a binary tree as consisting of a root and two disjoint binary trees, called the left and right 
subtrees of the root. 



6.4 Types of Binary Trees 

Strict Binary Tree: A binary tree is called strict binary tree if each node has exactly two children or no children. 
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Full Binary Tree: A binary tree is called full binary tree if each node has exactly two children and all leaf nodes are 
at same level. 

root 


Complete Binary Tree: Before defining the complete binary tree, let us assume that the height of the binary tree is h. 
In complete binary trees, if we give numbering for the nodes by starting at root (let us say the root node has 1) then 
we get a complete sequence from 1 to number of nodes in the tree. While traversing we should give numbering for 
NULL pointers also. A binary tree is called complete binary tree if all leaf nodes are at height h or h — 1 and also 
without any missing number in the sequence. 

root 


6.5 Properties of Binary Trees 

For the following properties, let us assume that the height of the tree is h- Also, assume that root node is at height zero. 

Height Number of nodes at level h 

h = 0 2° = 1 


h= 1 2 1 = 2 
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From the diagram we can infer the following properties: 

• The number of nodes n in a full binary tree is 2 h+1 — 1. Since, there are h levels we need to add all nodes at 
each level [2° 4- 2 X + 2 2 + •••+ 2 h = 2 h+1 - 1], 

• The number of nodes n in a complete binary tree is between 2 h (minimum) and 2 h+1 — 1 (maximum). For 
more information on this, refer Priority Queues chapter. 

• The number of leaf nodes in a full binary tree are 2 h . 

• The number of NULL links (wasted pointers) in a complete binary tree of n nodes are n + 1. 

Structure of Binary Trees 

Now let us define structure of the binary tree. For simplicity, assume that the data of the nodes are integers. One way 
to represent a node (which contains the data) is to have two links which points to left and right children along with 
data fields as shown below: 



struct BinaryTreeNode { 
int data; 

struct BinaryTreeNode ‘left; 
struct BinaryTreeNode ‘right; 

}; 

Note: In trees, the default flow is from parent to children and showing directed branches is not compulsory. For our 
discussion, we assume both the below representations are same. 



Operations on Binary Trees 

Basic Operations 

• Inserting an element in to a tree 

• Deleting an element from a tree 

• Searching for an element 

• Traversing the tree 

Auxiliary Operations 

• Finding size of the tree 

• Finding the height of the tree 

• Finding the level which has maximum sum 

• Finding least common ancestor (LCA) for a given pair of nodes and many more. 

Applications of Binary Trees 

Following are the some of the applications where binary trees play important role: 

• Expression trees are used in compilers. 

• Huffman coding trees which are used in data compression algorithms. 

• Binary Search Tree (BST), which supports search, insertion and deletion on a collection of items in O(logri) 
(average). 

• Priority Queues (PQ), which supports search and deletion of minimum(or maximum) on a collection of items 
in logarithmic time (in worst case), 
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6.6 Binary Tree Traversals 

In order to process trees, we need a mechanism for traversing them and that forms the subject of this section. The 
process of visiting all nodes of a tree is called tree traversal. Each of the nodes is processed only once but they may 
be visited more than once. As we have already seen that in linear data structures (like linked lists, stacks, queues, 
etc...), the elements are visited in sequential order. But, in tree structures there are many different ways. 

Tree traversal is like searching the tree except that in traversal the goal is to move through the tree in some particular 
order. In addition, all nodes are processed in the traversal but searching stops when the required node is found. 

Traversal Possibilities 

Starting at the root of a binary tree, there are three main steps that can be performed and the order in which they are 
performed defines the traversal type. These steps are; performing an action on the current node (referred to as 
"visiting" the node and denotes with “D”), traversing to the left child node (denotes with “L”), and traversing to the 
right child node (denotes with “R”). This process can be easily described through recursion. Based on the above 
definition there are 6 possibilities: 

1. LDR: Process left subtree, process the current node data and then process right subtree 

2. LRD: Process left subtree, process right subtree and then process the current node data 

3. DLR: Process the current node data, process left subtree and then process right subtree 

4. DRL: Process the current node data, process right subtree and then process left subtree 

5. RDL: Process right subtree, process the current node data and then process left subtree 

6. RLD: Process right subtree, process left subtree and then process the current node data 

Classifying the Traversals 

The sequence in which these entities processed defines a particular traversal method. The classification based on the 
order in which current node is processed. That means, if we are classifying based on current node (D) and if D comes 
in the middle then it does not matter whether L on left side of D or R is on left side of D. Similarly, it does not matter 
whether L is on right side of D or R is on right side of D. Due to this, the total 6 possibilities were reduced to 3 and 
they are: 

• Preorder (DLR) Traversal 

• Inorder (LDR) Traversal 

• Postorder (LRD) Traversal 

There is another traversal method which does not depend on above orders and it is: 

• Level Order Traversal: This method is inspired from Breadth First Traversal (BFS of Graph algorithms). 

Let us use the below diagram for remaining discussion. 



PreOrder Traversal 

In pre-order traversal, each node is processed before (pre) either of its sub-trees. This is the simplest traversal to 
understand. However, even though each node is processed before the subtrees, it still requires that some information 
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must be maintained while moving down the tree. In the example above, the 1 is processed first, then the left sub-tree 
followed by the right subtree. Therefore, processing must return to the right sub-tree after finishing the processing of 
the left subtree. To move to right subtree after processing left subtree, we must maintain the root information. The 
obvious ADT for such information is a stack. Because of its LIFO structure, it is possible to get the information about 
the right subtrees back in the reverse order. 

Preorder traversal is defined as follows: 

• Visit the root. 

• Traverse the left subtree in Preorder. 

• Traverse the right subtree in Preorder. 

The nodes of tree would be visited in the order: 1 2 4 5 3 6 7 

void PreOrder(struct BinaryTreeNode *root){ 
if(root) { 

printf(“%d”,root—+data); 

PreOrder(root—deft); 

PreOrder (root—>right); 

} 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Non-Recursive Preorder Traversal 

In recursive version a stack is required as we need to remember the current node so that after completing the left 
subtree we can go to right subtree. To simulate the same, first we process the current node and before going to left 
subtree, we store the current node on stack. After completing the left subtree processing, pop the element and go to its 
right subtree. Continue this process until stack is nonempty. 

void PreOrderNonRecursive(struct BinaryTreeNode *root){ 
struct Stack *S = CreateStackQ; 
while(l) { 

while(root) { 

//Process current node 
printf(“%d”,root—>data); 

Push(S,root); 

//If left subtree exists, add to stack 
root = root—>left; 

} 

if(IsEmptyStack(S)) 
break; 
root = Pop(S); 

//Indicates completion of left subtree and current node, now go to right subtree 
root = root—bright; 

I 

DeleteStack(S); 

} 

Time Complexity: 0(n). Space Complexity: O(n). 

InOrder Traversal 

In Inorder traversal the root is visited between the subtrees. Inorder traversal is defined as follows: 

• Traverse the left subtree in Inorder. 

• Visit the root. 
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• Traverse the right subtree in Inorder. 

The nodes of tree would be visited in the order: 4 2 5 1 6 3 7 

void InOrder(struct BinaryTreeNode *root){ 
if(root) { 

InOrder(root—deft); 
printf(“%d”, root—> data); 

InOrder(root—s-right); 

3 

] 

Time Complexity: O(n). Space Complexity: O(n). 

Non-Recursive Inorder Traversal 

Non-recursive version of Inorder traversal is very much similar to Preorder. The only change is, instead of processing 
the node before going to left subtree, process it after popping (which indicates after completion of left subtree 
processing). 

void InOrderNonRecursive(struct BinaryTreeNode *root){ 
struct Stack *S = CreateStack(); 
while(l) { 

while(root) { 

Push(S,root); 

//Got left subtree and keep on adding to stack 
root = root—deft; 

} 

if(IsEmptyStack(S)) 
break; 
root = Pop(S); 

printf(“%d”, root—>data); //After popping, process the current node 

//Indicates completion of left subtree and current node, now go to right subtree 
root = root—bright; 

} 

DeleteStack(S); 

1 

Time Complexity: O(n). Space Complexity: O(n). 

PostOrder Traversal 

In postorder traversal, the root is visited after both subtrees. Postorder traversal is defined as follows: 

• Traverse the left subtree in Postorder. 

• Traverse the right subtree in Postorder. 

• Visit the root. 

The nodes of tree would be visited in the order: 4 5 2 6 7 3 1 

void PostOrder(struct BinaryTreeNode *root){ 
if(root) { 

PostOrder(root—deft); 

PostOrder(root—>right); 
printf(“°/od”,root—»data); 

] 

} 
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Time Complexity: O(n). Space Complexity: O(n). 

Non-Recursive Postorder Traversal 

In preorder and inorder traversals, after poping the stack element we do not need to visit the same vertex again. But in 
postorder traversal, each node is visited twice. That means, after processing left subtree we will be visiting the current 
node and also after processing the right subtree we will be visiting the same current node. But we should be processing 
the node during the second visit. Here the problem is how to differentiate whether we are returning from left subtree 
or right subtree? 

Trick for this problem is: after popping an element from stack, check whether that element and right of top of the 
stack are same or not. If they are same then we are done with processing of left subtree and right subtree. In this case 
we just need to pop the stack one more time and print its data. 

void PostOrderNonRecursive(struct BinaryTreeNode *root){ 
struct Stack *S = CreateStack(); 
while (1) { 

if (root) { 

Push(S,root); 
root=root—deft; 

] 

else { if(IsEmptyStack(S)) { 

printf("Stack is Empty"); 
return; 

} 

else if(Top(S)—bright == NULL) { 

root = Pop(S); 
prin tf(“%d”, root —* data); 
if(root == Top(S)—aright) { 

printf(“%d” ,Top(S) —»data); 

Pop(S); 

} 

} 

if(!IsEmptyStack(S)) 

root=T op(S)—> right; 
else root=NULL; 

} 

) 

DeleteStack(S); 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Level Order Traversal 

Level order traversal is defined as follows: 

• Visit the root. 

• While traversing level l, keep all the elements at level l + 1 in queue. 

• Go to the next level and visit all the nodes at that level. 

• Repeat this until all levels are completed. 

The nodes of tree would be visited in the order: 1 2 3 4 5 6 7 
void LevelOrder(struct BinaryTreeNode *root){ 

struct BinaryTreeNode *temp;___ 


6.6 Binary Tree Traversals 


114 



Data Structures and Algorithms Made Easy 


Trees 


struct Queue *Q_= CreateQueue(); 
if(!root) 

return; 

EnQueue(Qjroot); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 

//Process current node 
printf(“%d”, temp—>data); 
if(temp—deft) 

EnQueue(Q, temp—deft); 
if(temp—aright) 

EnQueue(Q, temp—bright); 

} 

DeleteQueue(Q); 

} 

Time Complexity: O(n). Space Complexity: O(n). Since, in the worst case, all the nodes on the entire last level could 
be in the queue simultaneously. 

Problems on Binary Trees 

Problem-1 Give an algorithm for finding maximum element in binary tree. 

Solution: One simple way of solving this problem is: find the maximum element in left subtree, find maximum 
element in right sub tree, compare them with root data and select the one which is giving the maximum value. This 
approach can be easily implemented with recursion. 

int FindMax(stract BinaryTreeNode ’root) { 

int root_val, left, right, max = INT_MIN; 
if(root !=NULL) { 

root_val = root—>data; 
left = FindMax(root—deft); 
right = FindMax(root—aright); 

// Find the largest of the three values. 
if(left > right) 

max = left; 
else max = right; 
if(root_val > max) 

max = root_val; 

) 

return max; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-2 Give an algorithm for finding maximum element in binary tree without recursion. 

Solution: Using level order traversal: just observe the elements data while deleting. 

int FindMaxUsingLevelOrder(struct BinaryTreeNode *root){ 
struct BinaryTreeNode ’temp; 
int max = INT_MIN; 
struct Queue *Q_= CreateQueue(); 

EnQueue(Q,root); 
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while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 

// largest of the three values 
if(max < temp—»data) 

max= temp—>data; 
if(temp—deft) 

EnQueue (Q, temp—deft); 
if(temp—aright) 

EnQueue (Q, temp—bright); 

1 

DeleteQueue(Q); 
return max; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-3 Give an algorithm for searching an element in binary tree. 

Solution: Given a binary tree, return true if a node with the data is found in the tree. Recurse down the tree, choose 
the left or right branch by comparing the data with each nodes data. 

int FindInBinaryTreeUsingRecursion(struct BinaryTreeNode ’root, int data) { 
int temp; 

// Base case == empty tree, in that case, the data is not found so return false 
if(root == NULL) 
return 0; 

else { //see if found here 

if(data == root—>data) 
return 1; 

else { // otherwise recur down the correct subtree 

temp = FindlnBinaryTreeUsingRecursion (root—deft, data) 
if(temp != 0) 

return temp; 

else retum(FindInBinaryTreeUsingRecursion(root—bright, data)); 

} 

1 

return 0; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-4 Give an algorithm for searching an element in binary tree without recursion. 

Solution: We can use level order traversal for solving this problem. The only change required in level order traversal 
is, instead of printing the data we just need to check whether the root data is equal to the element we want to search. 

int SearchUsingLevelOrder(struct BinaryTreeNode ‘root, int data){ 
struct BinaryTreeNode ‘temp; 
struct Queue *Q; 
if(!root) 

return -1; 

Q_= CreateQueueQ; 

EnQueue(Q,root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
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//see if found here 
if(data == root—>data) 
return 1; 
if(temp—deft) 

EnQueue (Q, temp—deft); 
if(temp—bright) 

EnQueue (Q, temp—bright); 

3 

DeleteQueue(Q); 
return 0; 

3 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-5 Give an algorithm for inserting an element into binary tree. 

Solution: Since the given tree is a binary tree, we can insert the element wherever we want. To insert an element, we 
can use the level order traversal and insert the element wherever we found the node whose left or right child is NULL. 

void InsertInBinaryTree(struct BinaryTreeNode *root, int data){ 
struct Queue *Qj 
struct BinaryTreeNode *temp; 
struct BinaryTreeNode "newNode; 

newNode = (struct BinaryTreeNode *) malloc(sizeof(struct BinaryTreeNode)); 
newNode—deft = newNode—aright = NULL; 
if(!newNode) { 

printf(“Memory Error”); 
return; 

} 

if(!root) { 

root = newNode; 
return; 

} 

Q.= CreateQueue(); 

EnQueue(Q,root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
if(temp—deft) 

EnQueue(Q, temp—deft); 
else { temp—deft=newNode; 

DeleteQueue(Q); 
return; 

} 

if(temp—aright) 

EnQueue(Qj temp—bright); 
else { temp—>right=newNode; 

DeleteQueue(Q); 
return; 

} 

] 

DeleteQueue(Q); 

3 

Time Complexity: O(n). Space Complexity: O(n)._ 
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Problem-6 Give an algorithm for finding the size of binary tree. 

Solution: Calculate the size of left and right subtrees recursively, add 1 (current node) and return to its parent. 

// Compute the number of nodes in a tree, 
int SizeOfBinaryTree(struct BinaryTreeNode "root) { 
if(root==NULL) 
return 0; 

else retum(SizeOfBinaryTree(root—deft) + 1 + SizeOfB inary Tree (root—bright)); 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-7 Can we solve the Problem-6 without recursion? 

Solution: Yes, using level order traversal. 

int SizeofBTUsingLevelOrder(struct BinaryTreeNode *root){ 
struct BinaryTreeNode *temp; 
struct Queue *Q; 
int count = 0; 
if(!root) return 0; 

Q_= CreateQueue(); 

EnQueue(Q,root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
count++; 
if(temp—>left) 

EnQueue (Q, temp—deft); 
if(temp—>right) 

EnQueue (Q, temp^>right); 

} 

DeleteQueue (Q); 
return count; 

} 

Time Complexity: O (n). Space Complexity: O(n). 

Problem-8 Give an algorithm for printing the level order data in reserve order. For example, the output for the 
below tree should be: 4 5 6 7 2 3 1 

root 


Solution: 

void LevelOrderTraversalInReverse(struct BinaryTreeNode *root){ 
struct Queue *Q; 
struct Stack *s = CreateStackQ; 
struct BinaryTreeNode ‘temp; 
if(!root) return; 

Q_= CreateQueue(); 

EnQueue (Qj root)]__ 
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while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
if(temp—>right) 

EnQueue(Q, temp—bright); 
if(temp-»left) 

EnQueue (Q, temp—deft); 

Push(s, temp); 

3 

while(!IsEmptyStack(s)) 

printf(“%d”,Pop(s)—>data); 

} 

Time Complexity; O(n). Space Complexity; O(n). 

Problem-9 Give an algorithm for deleting the tree. 

Solution: To delete a tree we must traverse all the nodes of the tree and delete them one by one. So which traversal we 
should use Inorder, Preorder, Postorder or Level order Traversal? 

Before deleting the parent node we should delete its children nodes first. We can use postorder traversal as it does the 
work without storing anything. We can delete tree with other traversals also with extra space complexity. For the 
following tree nodes are deleted in order - 4,5,2,3,1. 

root 


void DeleteBinaryTree(struct BinaryTreeNode *root){ 
if(root == NULL) 
return; 

!* first delete both subtrees V 
DeleteBinaryTree (root—deft); 

DeleteBinaryTree(root—> right); 

//Delete current node only after deleting subtrees 
free(root); 

} 

Time Complexity; O(n). Space Complexity: O(n). 

Problem-10 Give an algorithm for finding the height (or depth) of the binary tree. 

Solution: Recursively calculate height of left and right subtrees of a node and assign height to the node as max of the 
heights of two children plus 1. This is similar to PreOrder tree traversal (and DFS of Graph algorithms). 

int HeightOfBinaryTree(struct Binary TreeNode *root){ 
int leftheight, rightheight; 
if(root == NULL) 
return 0; 

else { /* compute the depth of each subtree */ 
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leftheight = HeightOfBinaryTree(root—deft); 
rightheight = HeightOfBinaryTree(root—bright); 
if(leftheight > rightheight) 

retum(leftheight + 1); 
else return(rightheight + 1); 


} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-11 Can we solve the Problem-10 without recursion? 

Solution: Yes. Using level order traversal. This is similar to BFS of Graph algorithms. End of level is identified with 
NULL. 

int FindHeightofBinaryTree(struct BinaryTreeNode *root){ 
int level=l; 
struct Queue *Q; 
if(!root) return 0; 

Q_= CreateQueue(); 

EnQueue(Q,root); 

// End of first level 
EnQueue(Q,NULL); 
while(!IsEmptyQueue(Q)) { 
root=DeQueue(Q); 

// Completion of current level. 
if(root==NULL) { 

//Put another marker for next level. 
if(!IsEmptyQueue(Q)) 

EnQueue(Q,NULL); 

level++; 

3 

else { if(root—>left) 

EnQueue(Q, root—deft); 
if(root—bright) 

EnQueue(Q, root—bright); 

1 

} 

return level; 

1 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-12 Give an algorithm for finding the deepest node of the binary tree. 

Solution: 

struct BinaryTreeNode *DeepestNodeinBinaryTree(struct BinaryTreeNode *root){ 
struct BinaryTreeNode *temp; 
struct Queue *Q; 
if(!root) return NULL; 

Q_= CreateQueue(); 

EnQueue(Qroot); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 

if(temp—deft) __ 
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EnQueuelQ, temp—deft); 
if(temp—night) 

EnQueue(Q, temp—bright); 

1 

DeleteQueue(Q); 
return temp; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-13 Give an algorithm for deleting an element from binary tree. 

Solution: The deletion of a node in binary tree can be implemented as 

• Find the node which we want to delete. 

• Find the deepest node in the tree. 

• Replace the deepest nodes data with node to be deleted. 

• Then delete the deepest node. 

Problem-14 Give an algorithm for finding the number of leaves in the binary tree without using recursion. 

Solution: The set of all nodes whose both left and either right are NULL are called leaf nodes, 
void NumberOfLeavesInBTusingLevelOrder(struct BinaryTreeNode *root){ 
struct BinaryTreeNode ‘temp; 
struct Queue ‘Qj 
int count = 0; 
if(!root) return 0; 

Q.= CreateQueue(); 

EnQueuelQ^root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
if(!temp—>left && Itemp—night) 
eount++; 

else { if(temp—>left) 

EnQueue(Q, temp-»left); 
if(temp—night) 

EnQueue(Q, temp—night); 

} 

} 

DeleteQueue(Q); 
return count; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-15 Give an algorithm for finding the number of full nodes in the binary tree without using recursion. 

Solution: The set of all nodes with both left and right children are called full nodes, 
void NumberOfFullNodesInBTusingLevelOrder(stract BinaryTreeNode *root){ 
struct BinaryTreeNode ‘temp; 
struct Queue *Q; 
int count = 0; 
if(!root) return 0; 

Q.= CreateQueue(); 

EnQueue (Q,r oot); 
while(!IsEmptyQueue(Q)) ( 
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temp = DeQueue(Q); 
if(temp—>left && temp—bright) 
count++; 
if(temp—deft) 

EnQueue (Q, temp—deft); 
if (temp-aright) 

EnQueue (Q, temp-bright); 

} 

DeleteQueue(Q); 
return count; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-16 Give an algorithm for finding the number of half nodes (nodes with only one child) in the binary tree 
without using recursion. 

Solution: The set of all nodes with either left or either right child (but not both) are called half nodes, 
void NumberOfHalfNodesInBTusingLevelOrder(struct BinaryTreeNode *root){ 
struct BinaryTreeNode "temp; 
struct Queue *Q; 
int count = 0; 
if(!root) 

return 0; 

Q_= CreateQueueQ; 

EnQueue(Q,root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 

//we can use this condition also instead of two temp—deft A temp—>right 
if(!temp—deft && temp—bright || temp—►left && !temp-bright) 
count++; 
if(temp—►left) 

EnQueue (Q, temp—►left); 
if(temp—>right) 

EnQueue (Q, temp-bright); 

} 

DeleteQueue(Q); 
return count; 

3 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-17 Given two binary trees, return true if they are structurally identical. 

Solution: 

Algorithm: 

• If both trees are NULL then return true. 

• If both trees are not NULL, then compare data and recursively check left and right subtree structures. 
//Return true if they are structurally identical. 

int AreStructurullySameTrees(struct BinaryTreeNode *rootl, struct BinaryTreeNode *root2) { 

// both empty—>1 

if(rootl==NULL && root2==NULL) 
return 1; 

if(rootl==NULL || root2==NULL) __ 


6.6 Binary Tree Traversals 


122 



Data Structures and Algorithms Made Easy 


Trees 


return 0; 

// both non-empty—^-compare them 

return(rootl—>data == root2—*data && AreStmcturaIlySameTrees(rootl—*-left, root2—>left) && 
AreStructurullySameTrees(rootl -aright, root2-+right)); 

) 

Time Complexity: O(n). Space Complexity: O(n), for recursive stack. 

Problem-18 Give an algorithm for finding the diameter of the binary tree. The diameter of a tree (sometimes 
called the width) is the number of nodes on the longest path between two leaves in the tree. 

Solution: To find the diameter of a tree, first calculate the diameter of left subtree and right sub trees recursively. 
Among these two values, we need to send maximum along with current level (+1). 

int DiameterOfTree(struct BinaryTreeNode ‘root, int *ptr){ 
int left, right; 
if(!root) 

return 0; 

left = DiameterOfTree(root—>left, ptr); 
right = DiameterOfTree(root—bright, ptr); 
if(left + right > *ptr) 

*ptr = left + right; 
return Max(left, right)+l; 

} 

Time Complexity: O(n). Space Complexity: 0(n). 

Problem-19 Give an algorithm for finding the level which is having maximum sum in the binary tree. 

Solution: The logic is very much similar to finding number of levels. The only change is, we need to keep track of sums 
as well. 

int FindLevelwithMaxSum(struct BinaryTreeNode *root){ 
struct BinaryTreeNode ‘temp; 
int level=0, maxLevel=0; 
struct Queue *Q; 
int currentSum = 0, maxSum = 0; 
if(!root) return 0; 

Q.=CreateQueue(); 

EnQueue(Qjroot); 

EnQueue(Q,NULL); //End of first level. 

while(!IsEmptyQueue(Q)) { 
temp =DeQueue(Q); 

// If the current level is completed then compare sums 
if(temp == NULL) { 

if(currentSum> maxSum) { 

maxSum = currentSum; 
maxLevel = level; 

1 

currentSum = 0; 

//place the indicator for end of next level at the end of queue 
if(!IsEmptyQueue(Q)) 

EnQueue(Q,NULL); 

level++; 

} 
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else { currentSum += temp—»data; 
if(temp-deft) 

EnQueue(temp, temp—deft); 
if(root—>right) 

EnQueue(temp, temp-bright); 

} 

1 

return maxLevel; 

1 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-20 Given a binary tree, print out all of its root-to-leaf paths. 

Solution: Refer comments in functions. 

void PrintPathsRecur(struct BinaryTreeNode ‘root, int path[], int pathLen) { 
if(root ==NULL) 
return; 

// append this node to the path array 
path[pathLen] = root—>data; 
pathLen++; 

// it's a leaf, so print the path that led to here 
if(root—deft==NULL && root—>right==NULL) 

PrintAxray(path, pathLen); 
else { // otherwise try both subtrees 

PrintPathsRecur(root—deft, path, pathLen); 

PrintPathsRecur(root—aright, path, pathLen); 

1 

1 

// Function that prints out an array on a line, 
void Print Array (int ints[], int len) { 
for (int i=0; iclen; i++) 

printf(“%d”,ints[i]); 

1 

Time Complexity: O(n). Space Complexity: 0(n), for recursive stack. 

Problem-21 Give an algorithm for checking the existence of path with given sum. That means, given a sum check 
whether there exists a path from root to any of the nodes. 

Solution: For this problem, the strategy is: subtract the node value from the sum before calling its children recursively, 
and check to see if the sum is 0 when we run out of tree. 

int HasPathSum(struct BinaryTreeNode * root, int sum) { 

// return true if we run out of tree and sum==0 
if(root == NULL) return (sum == 0); 
else { // otherwise check both subtrees 

int remainingSum = sum - root—>data; 

if((root—deft && root—>right)||(!root—deft && !root—»right)) 

retum(HasPathSum(root—deft, remainingSum) || HasPathSum(root—right, remainingSum)); 
else if(root—deft) 

return HasPathSum(root—deft, remainingSum); 
else return HasPathSum(root—>right, remainingSum); 

] 
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} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-22 Give an algorithm for finding the sum of all elements in binary tree. 

Solution: Recursively, call left subtree sum, right subtree sum and add their values to current nodes data. 

int Add(struct BinaryTreeNode ‘root) { 
if(root == NULL) return 0; 

else return (root—>data + Add(root—deft) + Add(root—>right)); 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-23 Can we solve Problem-22 without recursion? 

Solution: We can use level order traversal with simple change. Every time after deleting an element from queue, add 
the nodes data value to sum variable. 

int Sumo£BTusingLevelOrder(struct BinaryTreeNode *root){ 
struct BinaryTreeNode ‘temp; 
struct Queue *Q; 
int sum = 0; 
if(!root) return 0; 

Q.= CreateQueueQ; 

EnQueue(Q,root); 
while(!IsEmptyQueue(Q)) { 
temp = DeQueue(Q); 
sum += temp—>data; 
if(temp—deft) 

EnQueue (Q, temp—deft); 
if(temp—►right) 

EnQueue (Q, temp—bright); 

3 

DeleteQueue(Q); 
return sum; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-24 Give an algorithm for converting a tree to its mirror. Mirror of a tree is another tree with left and 
right children of all non-leaf nodes interchanged. 



struct BinaryTreeNode *MirrorOfBinaryTree(struct BinaryTreeNode *root){ 
struct BinaryTreeNode * temp; 
if(root) { 
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MirrorOfBinaryTree(root—deft); 

MirrorOfBinaryTree(root—>right); 

/* swap the pointers in this node 7 
temp = root—deft; 
root—deft = root—>right; 
root—>right = temp; 

} 

return root; 

} 

Time Complexity: O(n). Space Complexity: 0(n). 

Problem-25 Given two trees, give an algorithm for checking whether they are mirrors of each other. 

Solution: 

int AreMirrors(struct BinaryTreeNode * rootl, struct BinaryTreeNode * root2) { 
if(rootl == NULL && root2 == NULL) return 1; 
if(rootl == NULL || root2 == NULL) return 0; 
if(rootl—»data != root2—>data) return 0; 

else return AreMirrors(rootl-deft, root2->right) && AreMirrors(rootl->right, root2—deft); 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-26 Give an algorithm for finding LCA (Least Common Ancestor) of two nodes in a Binary Tree. 

Solution: 

struct BinaryTreeNode *LCA(struct BinaryTreeNode ‘root, struct BinaryTreeNode ‘a, struct BinaryTreeNode *(3){ 
struct BinaryTreeNode ‘left, ‘right; 
if(root == NULL) return root; 
if(root == a \ | root == (3) return root; 
left = LCA (root—deft, a, (3); 
right = LCA (root—bright, a, P); 
if(left && right) 

return root; 

else return (left? left: right) 

1 

Time Complexity: 0(n). Space Complexity: O(n) for recursion. 

Problem-27 Give an algorithm for constructing binary tree from given Inorder and Preorder traversals. 

Solution: Let us consider the below traversals: 

Inorder sequence: D B E A F C 
Preorder sequence: A B D E C F 

In a Preorder sequence, leftmost element denotes the root of the tree. So we know ‘A 1 is root for given sequences. By 
searching ‘A’ in Inorder sequence we can find out all elements on left side of ‘A’ which come under left subtree and 
elements right side of ‘A’ which come under right subtree. So we get the below structure. 
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We recursively follow above steps and get the following tree. 

root 


Algorithm: BuildTreeQ 

1 Select an element from Preorder. Increment a Preorder index variable (preindex in below code) to pick next 
element in next recursive call. 

2 Create a new tree node (newNode) with the data as selected element. 

3 Find the selected elements index in Inorder. Let the index be inindex. 

4 Call BuildBinaryTree for elements before inindex and make the built tree as left subtree of newNode. 

5 Call BuildBinaryTree for elements after inindex and make the built tree as right subtree of newNode. 

6 return newNode. 

struct BinaryTreeNode * BuildBinaryTree(int inOrder[], int preOrder[], int inStrt, int inEnd){ 
static int preindex = 0; 
struct BinaryTreeNode ‘newNode 
if(inStrt > inEnd) return NULL; 

newNode = (struct BinaryTreeNode *) malloc (sizeof(struct BinaryTreeNode)); 
if(!newNode) { 

printf(“Memory Error”); 
return; 

} 

// Select current node from Preorder traversal using preindex 

newNode—>data = preOrder[preIndex]; 

prelndex++; 

if(inStrt == inEnd) /* if this node has no children then return V 

return newNode; 

/* else find the index of this node in Inorder traversal 7 
int inindex = Search(inOrder, inStrt, inEnd, newNode—>data); 

/* Using index in Inorder traversal, construct left and right subtress 7 
newNode—►left = BuildBinaryTree(inOrder, preOrder, inStrt, inlndex-l); 
newNode—>■ right = BuildBinaryTree(inOrder, preOrder, inlndex+l, inEnd); 
return newNode; 

} 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-28 If we are given two traversal sequences, can we construct the binary tree uniquely? 

Solution: It depends on what traversals are given. If one of the traversal methods is Inorder then the tree can be 
constructed uniquely, otherwise not. 

Therefore, following combination can uniquely identify a tree: 

• Inorder and Preorder 

• Inorder and Postorder 

• Inorder and Level-order 
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The following combinations do not uniquely identify a tree. 

• Postorder and Preorder 

• Preorder and Level-order 

• Postorder and Level-order 


For example, Preorder, Level-order and Postorder traversals are same for above trees: 



Preorder Traversal = AB Postorder Traversal = BA Level-order Traversal = AB 


So, even if three of them (PreOrder, Level-Order and PostOrder) are given, tree cannot be constructed uniquely. 


Problem-29 Give an algorithm for printing all the ancestors of a node in a Binary tree. For the below tree, for 7 
the ancestors are 13 7. 



Solution: Apart from the Depth First Search of this tree, we can use the following recursive way to print the ancestors. 

int PrintAllAncestors(struct BinaryTreeNode ‘root, struct BinaryTreeNode *node){ 
if (root == NULL) return 0; 

if(root—>left == node || root—>right == node || PrintAllAncestors(root—deft, node) || 

PrintAllAncestors(root—bright, node)) { 
printf(“%d”,root—>data); 
return 1; 


3 

return 0; 


} 

Time Complexity: O(n). Space Complexity: O(n) for recursion. 


Problem-30 Zigzag Tree Traversal: Give an algorithm to traverse a binary tree in Zigzag order. For example, the 
output for the below tree should be: 1 3 2 4 5 6 7 



Solution: This problem can be solved easily using two stacks. Assume the two stacks are: currentLevel and nextLevel. 
We would also need a variable to keep track of the current level order (whether it is left to right or right or left). 
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We pop from currentLevel stack and print the nodes value. Whenever the current level order is from left to right, 
push the nodes left child, then its right child to stack nextLevel. Since a stack is a Last In First OUT (LIFO) structure, 
next time when nodes are popped off nextLevel, it will be in the reverse order. On the other hand, when the current 
level order is from right to left, we would push the nodes right child first, then its left child. Finally, don't forget to 
swap those two stacks at the end of each level (t. e., when currentLevel is empty). 

void ZigZagTraversal(struct BinaryTreeNode *root){ 
struct BinaryTreeNode *temp; 
int leftToRight = 1; 
if(!root) return; 

struct Stack 'currentLevel = CreateStack(), 'nextLevel = CreateStack(); 

Push(currentLevel, root); 
while(!IsEmptyStack(currentLevel)) { 
temp = Pop(currentLevel); 
if(temp) { 

printf(“%d”,temp—»data); 
if(leftToRight) { 

if(temp—deft) Push (nextLevel, temp—deft); 
if(temp—bright) Push(nextLevel, temp—>right); 

} 

else { if(temp—bright) Push(nextLevel, temp—bright); 

if(temp—deft) Push(nextLevel, temp—deft); 

} 

} 

if(IsEmptyStack(currentLevel)) { 

leftToRight = 1-leftToRight; 
swap (currentLevel, nextLevel); 

} 

] 

] 

Time Complexity: O(n). Space Complexity: Space for two stacks = O(n) + O(n) = 0(n). 

Problem-31 Give an algorithm for finding the vertical sum of a binary tree. For example, 

The tree has 5 vertical lines 

Vertical-1: nodes-4 => vertical sum is 4 

Vertical-2: nodes-2 => vertical sum is 2 

Vertical-3: nodes-1,5,6 => vertical sum is 1 + 5 + 6 = 12 

Vertical-4: nodes-3 => vertical sum is 3 

Vertical-5: nodes-7 => vertical sum is 7 

We need to output: 4 2 12 3 7 



Solution: We can do an inorder traversal and hash the column. We call VerticalSumInBinaryTree(root, 0) which 
means the root is at column 0. While doing the traversal, hash the column and increase its value by root -> data. 
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void VerticalSuminBinaryTree (struct BinaryTreeNode *root, int column)! 
if(root==NULL) return; 

VerticalSumlnBinaryTree(root—deft, column-1); 

//Refer Hashing chapter for implementation of hash table 
Hash[column] += root—>data; 

VerticalSumlnBinaryTree(root—>• right, column+1); 

} 

VerticalSumInBinaryTree(root, 0); 

Print Hash; 


Problem-32 How many dif ferent binary trees are possible with n nodes? 


Solution: For example, consider a tree with 3 nodes (n = 3), it will have the maximum combination of 5 different (i.e., 
2 3 — 3 = 5) trees. 





In general, if there are n nodes, there exist 2 71 — n different trees. 

Problem-33 Given a tree with a special property where leaves are represented with ‘L’ and internal node with T. 
Also, assume that each node has either 0 or 2 children. Given preorder traversal of this tree, construct the tree [1]. 
Example: Given preorder string => ILILL 

root 


Solution: First, we should see how preorder traversal is arranged. Pre-order traversal means first put root node, then 
pre-order traversal of left subtree and then pre-order traversal of right subtree. In normal scenario, it’s not possible to 
detect where left subtree ends and right subtree starts using only pre-order traversal. Since every node has either 2 
children or no child, we can surely say that if a node exists then its sibling also exists. So every time we are computing 
a subtree, we need to compute its sibling subtree as well. 



Secondly, whenever we get ‘L’ in the input string, that is a leaf and we can stop for a particular subtree at that point. 
After this ‘L’ node (left child of its parent ‘L’), its sibling starts. If ‘L’ node is right child of its parent, then we need to 
go up in the hierarchy to find next subtree to compute. Keeping above invariant in mind, we can easily determine 
when a subtree ends and next start. It means that we can give any start node to our method and it can easily complete 
the subtree it generates going outside of its nodes. We just need to take care of passing correct start nodes to different 
sub-trees. 


struct BinaryTreeNode *BuildTreeFromPreOrder(char* A, int *i){ 
struct BinaryTreeNode ‘newNode; 

newNode = (struct BinaryTreeNode *) malloc(sizeof(struct BinaryTreeNode)); 
newNode—i-data = A[*i]; 
newNode—deft = newNode—aright = NULL; 
if(A == NULL){ //Boundary Condition 

free(newNode); 
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return NULL; 

1 

if(A[*i] == 'L') //On reaching leaf node, return 

return newNode; 

*i = *i + 1; //Populate left sub tree 

newNode—deft = BuildTreeFromPreOrder(A, i); 

*i = *i + 1; //Populate right sub tree 

newNode—aright = BuildTreeFromPreOrder(A, i); 
return newNode; 

} 

Time Complexity: O(n). 

Problem-34 Given a binary tree with three pointers (left, right and nextSibling), give an algorithm for filling the 
nextSibling pointers assuming they are NULL initially. 

Solution: We can use simple queue (similar to the solution of Problem-11). Let us assume that the structure of binary 
tree is: 

struct BinaryTreeNode { 

struct BinaryTreeNode* left; 
struct BinaryTreeNode* right; 
struct BinaryTreeNode* nextSibling; 

3 ; 

int FillNextSiblings(struct BinaryTreeNode *root){ 
struct BinaryTreeNode *temp; 
struct Queue *Q; 
if(!root) return 0; 

Q_= CreateQueueQ; 

EnQueue(Q,root); 

EnQueue(QjNULL); 
while(!IsEmptyQueue(Q)) { 
temp =DeQueue(Q); 

// Completion of current level. 

if(temp ==NULL) {//Put another marker for next level. 
if(!IsEmptyQueue(Q)) 

EnQueue(Q,NULL); 

} 

else { temp-»nextSibling = QueueFront(Q); 

if(root—deft) EnQueue(Q, temp—deft); 
if(root—aright) EnQueue(Q, temp-bright); 

} 

} 

3 

Time Complexity: O(n). Space Complexity: O(n). 

Problem-35 For Problem-34, is there any otherway of solving? 

Solution: The trick is to re-use the populated nextSibling pointers. As mentioned earlier, we just need one more step 
for it to work. Before we passed the left and right to the recursion function itself, we connect the right childs 
nextSibling to the current nodes nextSibling left child. In order for this to work, the current node nextSibling 
pointer must be populated, which is true in this case. 

void FillNextSiblings(struct BinaryTreeNode* root) { 
if (!root) return; 
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if (root—>left) root—>left—>nextSibling = root—bright; 
if (root—>right) 

root-»right—mextSibling = (root—mextSibling) ? root—mextSibling—»left: NULL; 
FillNextSiblings(root—deft); 

FillNextSiblings(root—>right); 

] 

Time Complexity; O(n). 

6.7 Generic Trees (N-ary Trees) 

In the previous section we have discussed binary trees where each node can have maximum of two children only and 
represented them easily with two pointers. But suppose if we have a tree with many children at every node and also if 
we do not know how many children a node can have, how do we represent them? For example, consider the tree 
shown above. 


For a tree like this, how do we represent the tree? 

In the above tree, there are nodes with 6 children, with 3 children, 2 children, with 1 child, and with zero children 
(leaves). To present this tree we have to consider the worst case (6 children) and allocate those many child pointers for 
each node. Based on this, the node representation can be given as: 
struct TreeNode{ 
int data; 

struct TreeNode ‘firstChild; 
struct TreeNode *secondChild; 
struct TreeNode ‘thirdChild; 
struct TreeNode ‘fourthChild; 
struct TreeNode *fifthChild; 
struct TreeNode *sixthChild; 

}; 

Since we are not using all the pointers in all the cases there is a lot of memory wastage. Also, another problem is that, 
in advance we do not know the number of children for each node. In order to solve this problem we need a 
representation that minimizes the wastage and also accept nodes with any number of children. 

Representation of Generic Trees 

Since our objective is to reach all nodes of the tree, a possible solution to this is as follows: 

• At each node link children of same parent (siblings) from left to right. 

• Remove the links from parent to all children except the first child. 
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What these above statements say is if we have a link between childrens then we do not need extra links from parent to 
all children. This is because we can traverse all the elements by starting at the first child of the parent. So if we have 
link between parent and first child and also links between all children of same parent then it solves our problem. This 
representation is sometimes called first child/next sibling representation. First child/next sibling representation of the 
generic tree is shown above. The actual representation for this tree is: 



Based on this discussion, the tree node declaration for general tree can be given as: 
struct TreeNode { 
int data; 

struct TreeNode *firstChild; 
struct TreeNode ‘nextSibling; 

}; 

Note: Since we are able to represent any generic tree with binary representation, in practice we use only binary tree. 

Problems on Generic Trees 

Problem-36 Given a tree, give an algorithm for finding the sum of all the elements of the tree. 

Solution: The solution is similar to what we have done for simple binary trees. That means, traverse the complete list 
and keep on adding the values. We can either use level order traversal or simple recursion, 
int FindSum(struct TreeNode *root){ 
if(!root) return 0; 

return root—»data + FindSum(root—»firstChild) + FindSum(root—^sibling); 

} 
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Time Complexity: O(n). Space Complexity: 0(1) (if we do not consider stack space), otherwise 0(n). 

Note: All problems which we have discussed for binary trees are applicable for generic trees also. Instead of left and 
right pointers we just need to use firstChild and nextSibling. 

Problem-37 For a 4-ary tree (each node can contain maximum of 4 children), what is the maximum possible 
height with 100 nodes? Assume height of a single node is 0. 

Solution: In 4-ary tree each node can contain 0 to 4 children and to get maximum height, we need to keep only one 
child for each parent. With 100 nodes the maximum possible height we can get is 99. If we have a restriction that at 
least one node is having 4 children, then we keep one node with 4 children and remaining all nodes with 1 child. In 
this case, the maximum possible height is 96. Similarly, with n nodes the maximum possible height is n - 4. 


Problem-38 For a 4-ary tree (each node can contain maximum of 4 children), what is the minimum possible 
height with n nodes? 

Solution: Similar to above discussion, if we want to get minimum height, then we need to fill all nodes with maximum 
children (in this case 4). Now let’s see the following table, which indicates the maximum number of nodes for a given 
height. 


Height, h 

Maximum Nodes at height, h = 4 h 

4 h+i_l 

Total Nodes height h- ^ 

0 

1 

1 

1 

4 

1+4 

2 

4x4 

1+4x4 

3 

4x4x4 

1+4x4 + 4x4x4 


For a given height h the maximum possible nodes are: • 


n = ■ 


, 4 ^+i = 3n + 1 => (h + l)log4 = log(3n + 1) => h + 1 = log 4 (3n + 1) => h = log 4 (3n + 1) - 1 


Problem-39 Given a parent array P, where P[i] indicates the parent of i th node in the tree (assume parent of root 
node is indicated with —1). Give an algorithm for finding the height or depth of the tree. 


Solution: 

For example: if the P is 


Its corresponding tree is: 


From the problem definition, the given array is representing the parent array. That means, we need to consider the 
tree for that array and find the depth of the tree. The depth of this given tree is 4. If we carefully observe, we just need 
to start at every node and keep going to its parent until we reach —1 and also keep track of the maximum depth among 
all nodes. 


-1 

0 

1 

6 

6 

0 

0 

2 

7 

0 

1 

2 

3 

4 

5 

6 

7 

8 



int FindDepthInGenericTree(int P[], int n){ 

int maxDepth =-l, currentDepth =-l, j; 
for (int i = 0; i < n; i++) { 
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currentDepth = 0; j = i; 
while(P[j] != -1) { 

currentDepth++; j = P[j]; 

} 

if(currentDepth > maxDepth) 

maxDepth = currentDepth; 

} 

return maxDepth; 

} 

Time Complexity; 0(n 2 ). For skew trees we will be re-calculating the same values. Space Complexity: 0(1). 

Note: We can optimize the code by storing the previous calculated nodes depth in some hash table or other array. 
This reduces the time complexity but uses extra space. 

Problem-40 Given a node in the generic tree, give an algorithm for counting the number of siblings for that node. 

Solution: Since tree is represented with first child/next sibling method, the tree structure can be given as: 
struct TreeNode{ 
int data; 

struct TreeNode ‘firstChild; 
struct TreeNode 'nextSibling; 

}; 

For a given node in the tree, we just need to traverse all its nextsiblings. 
int SiblingsCount(struct TreeNode *current){ 
int count = 0; 
while (current) { 
count++; 

current = current—►nextSibling; 

} 

reutm count; 

} 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-41 Given a node in the generic tree, give an algorithm for counting the number of children for that 
node. 

Solution: Since the tree is represented as first child/next sibling method, the tree structure can be given as: 
struct TreeNode{ 
int data; 

struct TreeNode ‘firstChild; 
struct TreeNode ‘nextSibling; 

}; 

For a given node in the tree, we just need to point to its first child and keep traversing all its nextsiblings. 
int ChildCount(struct TreeNode *current){ 
int count = 0; 

current = current—►firstChild; 
while(current) { 
count++; 

current = current—►nextSibling; 

) 

reutm count; 

} 
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Time Complexity: O(n). Space Complexity: 0(1). 

Problem-42 Given two trees how do we check whether the trees are isomorphic to each other or not? 

Solution: Two binary trees rootl and root 2 are isomorphic if they have the same structure. The values of the nodes 
does not affect whether two trees are isomorphic or not. In the diagram below, the tree in the middle is not 



int IsIsIsomorphic(struct TreeNode "rootl, struct TreeNode *root2){ 
if(!rootl && !root2) return 1; 
if((!rootl && root2) || (rootl && !root2)) 
return 0; 

return (IsIsomorphic(rootl->left, root2—deft) && IsIsomorphic(rootl—>right, root2—>right)); 

3 

Time Complexity: 0(n). Space Complexity: 0(n). 


Problem-43 Given two trees how do we check whether they are quasi-isomorphic to each other or not? 



children of some of the nodes of rootl. The data in the nodes are not important in determining quasi-isomorphism, 
only the shape is important. The trees below are quasi-isomorphic because if the children of the nodes on the left are 
swapped, the tree on the right is obtained. 

int QuasiIsomorphic(struct TreeNode "rootl, struct TreeNode *root2){ 
if(!rootl && !root2) return 1; 
if((!rootl && root2) || (rootl && !root2)) 
return 0; 

return (Quasiisomorphic(rootl~>left, root2—deft) && Quasilsomorphic(rootl —>right, root2-»right) 

|| Quasilsomorphic(rootl—>right, root2—deft) && Quasiisomorphic (rootl—deft, root2—►right)); 

} 

Time Complexity: 0(n). Space Complexity: 0(n). 
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Problem-44 A full k —ary tree is a tree where each node has either 0 or k children. Given an array which 
contains the preorder traversal of full k —ary tree, give an algorithm for constructing the full k —ary tree. 

Solution: In k —ary tree, for a node at i th position its children will be at k * i + 1 to k * i + k. For example, the below 



As we have seen, in preorder traversal first left subtree is processed then followed by root node and right subtree. 
Because of this, to construct a full k-ary, we just need to keep on creating the nodes without bothering about the 
previous constructed nodes. We can use this trick to build the tree recursively by using one global index. Declaration 
for fc-ary tree can be given as: 

struct K-aryTreeNode{ 
char data; 

struct K-aryTreeNode *child[]; 

}; 

int *Ind = 0; 

struct K-aryTreeNode *BuildK-aryTree(char A[], int n, int k){ 
if(n<=0) 

return NULL; 

struct K-aryTreeNode ‘newNode = (struct K-aryTreeNode*) malloc(sizeof(struct K-aryTreeNode)); 
if(!newNode) { 

printf(“Memory Error”); 
return; 

1 

newNode—»child = (struct K-aryTreeNode*) malloc( k * sizeof(struct K-aryTreeNode)); 
if(!newNode—>-child) { 

printf(“Memory Error”); 
return; 

} 

newNode—>data = A[Ind]; 
for (int i = 0; i<k; i++) { 

if(k * Ind + i <n) { 

Ind++; 

newNode—>child[i] = BuildK-aryTree(A, n, k,Ind ); 

} 

else newNode—»child[i] =NULL; 

1 

return newNode; 

} 

Time Complexity: O(n), where n is the size of the pre-order array. This is because we are moving sequentially and not 
visiting the already constructed nodes. 
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6.8 Threaded Binary Tree Traversals [Stack or Queue less Traversals] 

In parlipr sections we have seen that, preorder, inorder and postorder binary tree traversals used stacks and 
level order traversal used queues as an auxiliary data structure. In this section we will discuss new traversal 
algorithms which do not need both stacks and queues and such traversal algorithms are called 
threaded binary tree traversals or stack/queue less traversals. 

Issues with Regular Binary Tree Traversals 

• The storage space required for the stack and queue is large. 

• The majority of pointers in any binary tree are NULL. For example, a binary tree with n nodes has n + 1 



• It is difficult to find successor node (preorder, inorder and postorder successors) for a given node. 

Motivation for Threaded Binary Trees 

To solve these problems, one idea is to store some useful information in NULL pointers. If we observe previous 
traversals carefully, stack/queue is required because we have to record the current position in order to move to right 
subtree after processing the left subtree. If we store the useful information in NULL pointers, then we don’t have to 
store such information in stack/queue. The binary trees which store such information in NULL pointers are called 
threaded binary trees. From the above discussion, let us assume that we have decided to store some useful 
information in NULL pointers. The next question is what to store? 

The common convention is put predecessor/successor information. That means, if we are dealing with preorder 
traversals then for a given node, NULL left pointer will contain preorder predecessor information and NULL right 
pointer will contain preorder successor information. These special pointers are called threads. 

Classifying Threaded Binary Trees 

The classification is based on whether we are storing useful information in both NULL pointers or only in one of them. 

• If we store predecessor information in NULL left pointers only then we call such binary trees as 
left threaded binary trees. 

• If we store successor information in NULL right pointers only then we call such binary trees as 
right threaded binary trees. 

• If we store predecessor information in NULL left pointers only then we call such binary trees as 
fully threaded binary trees or simply threaded binary trees. 
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Note: For the remaining discussion we consider only (fully ) threaded binary trees. 


Types of Threaded Binary Trees 

Cased on above discussion we get three representations for threaded binary trees. 

• Preorder Threaded Binary Trees-. NULL left pointer will contain PreOrder predecessor information and 
NULL right pointer will contain PreOrder successor information 

• Inorder Threaded Binary Trees: NULL left pointer will contain InOrder predecessor information and 
NULL right pointer will contain InOrder successor information 

• Postorder Threaded Binary Trees: NULL left pointer will contain PostOrder predecessor information and 
NULL right pointer will contain PostOrder successor information 

Note: As the representations are similar, for the remaining discussion, we will use InOrder threaded binary trees. 


Threaded Binary Tree structure 

Any program examining the tree must be able to differentiate between a regular left /right pointer and a thread. To 
do this, we use two additional fields into each node giving us, for threaded trees, nodes of the following form: 


Left 

LTag 

data 

RTag 

Right 

_2s. 


struct ThreadedBinaryTreeNode{ 

struct ThreadedBinaryTreeNode ‘left; 
int LTag; 
int data; 
int RTag; 

struct ThreadedBinaryTreeNode ‘right; 


Difference between Binary Tree and Threaded Binary Tree Structures 



Regular Binary Trees 

Threaded Binary Trees 

if LTag == 0 

NULL 

left points to the in-order predecessor 

if LTag == 1 

left points to the left child 

left points to left child 

if RTag == 0 

NULL 

right points to the in-order successor 

if RTag == 1 

right points to the right child 

right points to the right child 


Note: Similarly, we can define for preorder/postorder differences as well. 

As an example, let us try representing a tree in inorder threaded binary tree form. The below tree shows how an 
inorder threaded binary tree will look like. The dotted arrows indicate the threads. If we observe, the left pointer of 
left most node (2) and right pointer of right most node (31) are hanging. 
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What should leftmost and rightmost pointers point to? 

In the representation of a threaded binary tree, it is convenient to use a special node Dummy which is always present 
even for an empty tree. Note that, right tag of dummy node is 1 and its right child points to itself. 


For Empty Tree 


For Normal Tree 



1 1 
i 
i 

/ 


With this convention the above tree can be represented as: 



Dummy Node 


Finding Inorder Successor in Inorder Threaded Binary Tree 

To find inorder successor of a given node without using a stack, assume that the node for which we want to find the 
inorder successor is P. 

Strategy: If P has a no right subtree, then return the right child of P. If P has right subtree, then return the left of the 
nearest node whose left subtree contains ?. 

struct ThreadedBinaryTreeNode* InorderSuccessor(struct ThreadedBinaryTreeNode *P){ 
struct ThreadedBinaryTreeNode ‘Position; 
if(P—►RTag == 0) return P->right; 
else { Position = P—>right; 

while(Position—►LTag == 1) 

Position = Position—deft; 
return Position; 

} 

} 

Time Complexity: O(n). Space Complexity: 0(1). 


Inorder Traversal in Inorder Threaded Binary Tree 

We can start with dummy node and call InorderSuccessor() to visit each node until we reach dummy node, 
void InorderTraversal(struct ThreadedBinaryTreeNode *root){ 

struct ThreadedBinaryTreeNode *P = InorderSuccessor(root); 
while(P != root) { 

P = InorderSuccessor(P); 

printf(“%d”,P—>data); ___ 
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} 

} 

Other way of coding: 

void InorderTraversal(stmct ThreadedBinaryTreeNode *root){ 
struct ThreadedBinaryTreeNode *P = root; 
while(l) { P = InorderSuccessor(P); 
if(P == root) return; 
printf(“%d”,P—»data); 

} 

) 

Time Complexity: O(n). Space Complexity: 0(1). 

Finding PreOrder Successor in InOrder Threaded Binary Tree 

Strategy: If P has a left subtree, then return the left child of P. If P has no left subtree, then return the right child of 
the nearest node whose right subtree contains P. 

struct ThreadedBinaryTreeNode* PreorderSuccessor(struct ThreadedBinaryTreeNode *P){ 
struct ThreadedBinaryTreeNode ‘Position; 
if(P—>LTag == 1) return P—>left; 
else { Position = P; 

while(Position—>RTag == 0) 

Position = Position—bright; 
return Position-Bright; 

] 

} 

Time Complexity: O(n). Space Complexity: 0(1). 

PreOrder Traversal of InOrder Threaded Binary Tree 

As similar to inorder traversal, start with dummy node and call PreorderSuccessorQ to visit each node until we get 
dummy node again. 

void PreorderTraversal(struct ThreadedBinaryTreeNode *root){ 
struct ThreadedBinaryTreeNode *P; 

P = PreorderSuccessor(root); 
while(P != root) { 

P = PreorderSuccessor(P); 
printf(“°/od”,P—>data); 

} 

1 

Other way of coding: 

void PreorderTraversal(struct ThreadedBinaryTreeNode ‘root) { 
struct ThreadedBinaryTreeNode *P = root; 
while(l){P = PreorderSuccessor(P); 
if(P == root) return; 
printf(“%d”, P—»data); 

} 

} 

Time Complexity: 0(n). Space Complexity: 0(1). 

Note: From the above discussion, it should be clear that inorder and preorder successor finding is easy with threaded 
binary trees. But finding postorder successor is very difficult if we do not use stack. 
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Insertion of Nodes in InOrder Threaded Binary Trees 

For simplicity, let us assume that there are two nodes P and Q and we want to attach Q to right of P . For this we will 
have two cases. 

• Node P does not has right child: In this case we just need to attach Q to P and change its left and right 



• Node P has right child (say, ft): In this case we need to traverse R’s left subtree and find the left most node 
and then update the left and right pointer of that node (as shown below). 



void InsertRightInInorderTBT(struct ThreadedBinaryTreeNode *P, struct ThreadedBinaryTreeNode *Q){ 
struct ThreadedBinaryTreeNode ‘Temp; 

Q.-^ right = P—bright; 

Q_—>RTag = P—>RTag; 

Q.—deft = P; 

Q->LTag = 0; 

P—>right = Q; 

P—f-RTag = 1; 

if(Q_—>RTag == 1) { //Case-2 

Temp = Q.-^ right; 
while(Temp—>LTag) 

Temp = Temp—deft; 

Temp—deft = Q; 

1 

) 

Time Complexity: 0(n). Space Complexity: 0(1). 
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Problems on Threaded binary Trees 

Problem-45 For a given binary tree (not threaded) how do we find the preorder successor? 

Solution: For solving this problem, we need to use an auxiliary stack S. On the first call, the parameter node is a 
pointer to the head of the tree, thereafter its value is NULL. Since we are simply asking for the successor of the node 
we got last time we called the function. It is necessary that the contents of the stack S and the pointer P to the last 
node “visited” are preserved from one call of the function to the next, they are defined as static variables. 

// pre-order successor for an unthreaded binary tree 
struct BinaryTreeNode *PreorderSuccssor(struct BinaryTreeNode *node){ 
static struct BinaryTreeNode *P; 
static Stack *S = CreateStackQ; 
if(node != NULL) 

P = node; 

if(P—deft != NULL) { 

Push(S,P); 

P = P—deft; 

3 

else { while (P—>right = NULL) 

P = Pop(S); 

P = P—>right; 

} 

return P; 

) 

Problem-46 For a given binary tree (not threaded) how do we find the inorder successor? 

Solution: Similar to above discussion, we can find the inorder successor of a node as: 

// In-order successor for an unthreaded binary tree 
struct BinaryTreeNode *InorderSuccssor(struct BinaryTreeNode *node)( 
static struct BinaryTreeNode *P; 
static Stack *S = CreateStackQ; 
if(node != NULL) 

P = node; 

if(P—> right == NULL) 

P = Pop(S); 
else { P = P—> right; 

while (P—deft != NULL) 

Push(S, P); 

P = P—deft; 

} 

return P; 

} 

6.9 Expression Trees 

A tree representing an expression is called as an expression tree. In expression trees leaf nodes are operands and non¬ 
leaf nodes are operators. That means, an expression tree is a binary tree where internal nodes are operators and leaves 
are operands. Expression tree consists of binary expression. But for a unary operator, one subtree will be empty. Below 
figure shows a simple expression tree for (A + B * C) / D. 
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Algorithm for Building Expression Tree from Postfix Expression 

struct BinaryTreeNode *BuildExprTree(char postfixExpr[], int size){ 
struct Stack *S = Stack(size); 
for (int i = 0; i< size; i++) { 

if(postfixExpr[i] is an operand) { 

struct BinaryTreeNode newNode = (struct BinaryTreeNode*) 
malloc( sizeof (struct BinaryTreeNode)); 
if(!newNode) { 

printf(“Memory Error”); 
return; 

} 

newNode—>-data =postfixExpr[i]; 
newNode—>left = newNode—bright = NULL; 

Push(S, newNode); 

) 

else { 

struct BinaryTreeNode *T2 = Pop(S), *T1 = Pop(S); 
struct BinaryTreeNode newNode = (struct BinaryTreeNode*) 
malloc(sizeof(struct BinaryTreeNode)); 
if(!newNode) { 

printf(“Memory Error”); return; 

} 

newNode—>data = postfixExpr[i]; 
newNode—fieft = Tl; newNode^right = T2; 

Push(S, newNode); 

} 

3 

return S; 

} 

Example: Assume that one symbol is read at a time. If the symbol is an operand, we create a tree node and push a 
pointer to it onto a stack. If the symbol is an operator, pop pointers to two trees 71 and Ti from the stack (71 is popped 
first) and form a new tree whose root is the operator and whose left and right children point to Ti and 71 respectively. 
A pointer to this new tree is then pushed onto the stack. 

As an example, assume the input is A B C * + D /. The first three symbols are operands, so create tree nodes and push 
pointers to them onto a stack as shown below. 
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Next, an operator is read, so two pointers to trees are popped, a new tree is formed and a pointer to it is pushed onto 
the stack. 



Next, an operator V is read, so two pointers to trees are popped, a new tree is formed and a pointer to it is pushed onto 



Finally, the last symbol (7’) is read, two trees are merged and a pointer to the final tree is left on the stack. 
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6.10 XOR Trees 



This concept is very much similar to memory efficient doubly linked lists of Linked Lists chapter. Also, like 
threaded binary trees this representation does not need stacks or queues for traversing the trees. This representation is 
used for traversing hack (to parent) and forth (to children) using © operation. To represent the same in XOR trees, for 
each node below are the rules used for representation: 

• Each nodes left will have the © of its parent and its left children. 

• Each nodes right will have the © of its parent and its right children. 

• The root nodes parent is NULL and also leaf nodes children are NULL nodes. 

Based on the above rules and discussion the tree can be represented as: 



The major objective of this presentation is ability to move to parent as well to children. Now, let us see how to use this 
representation for traversing the tree. For example, if we are at node B and want to move to its parent node A, then we 
just need to perform © on its left content with its left child address (we can use right child also for going to parent 
node). 

Similarly, if we want to move to its child (say, left child D) then we have to perform © on its left content with its 
parent node address. One important point that we need to understand about this representation is: When we are at 
node B how do we know the address of its children D? Since the traversal starts at node root node, we can apply © on 
roots left content with NULL. As a result we get its left child, B. When we are at B, we can apply © on its left content 
with A address. 
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6.11 Binary Search Trees (BSTs) 

Why Binary Search Trees? 

In previous sections we have discussed different tree representations and in all of them we did not impose any 
restriction on the nodes data. As a result, to search for an element we need to check both in left subtree and also right 
subtree. Due to this, the worst case complexity of search operation is O(n). 

In this section, we will discuss another variant of binary trees: Binary Search Trees (BSTs). As the name suggests, the 
main use of this representation is for searching. In this representation we impose restriction on the kind of data a 
node can contain. As a result, it reduces the worst case average search operation to O (logn). 

Binary Search Tree Property 

In binary search trees, all the left subtree elements should be less than root data and all the right subtree elements 
should be greater than root data. This is called binary search tree property. Note that, this property should be satisfied 
at every node in the tree. 

• The left subtree of a node contains only nodes with keys less than the nodes key. 

• The right subtree of a node contains only nodes with keys greater than the nodes key. 

• Both the left and right subtrees must also be binary search trees. 



Example: The left tree is a binary search tree and right tree is not binary search tree (at node 6 it’s not satisfying the 
binary search tree property). 



Binary Search Tree Declaration 

There is no difference between regular binary tree declaration and binary search tree declaration. The difference is 
only in data but not in structure. But for our convenience we change the structure name as: 

struct BinarySearchTreeNode{ 
int data; 

struct BinarySearchTreeNode ’left; 
struct BinarySearchTreeNode ’right; 
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Operations on Binary Search Trees 

Main operations: The main operations that were supported by binary search trees are: 

• Find/ Find Min im u m / Find Maximum element in binary search trees 

• Inserting an element in binary search trees 

• Deleting an element from binary search trees 

Auxiliary operations: Checking whether the given tree is a binary search tree or not 

• Finding fc th -smallest element in tree 

• Sorting the elements of binary search tree and many more 

Important Notes on Binary Search Trees 

• Since root data is always in between left subtree data and right subtree data, performing inorder traversal on 
binary search tree produces a sorted list. 

• While solving problems on binary search trees, most of the time, first we process left subtree, process root 
data and then process right subtree. That means, depending on the problem only the intermediate step 
(processing root data) changes and we will not touch first and third steps. 

• If we are searching for an element and if the left subtree roots data is less than the element we want to search 
then skip it. Same is the case with right subtree as well. Because of this binary search trees takes less time for 
searching an element than regular binary trees. In other words, the binary search trees consider only either 
left or right subtrees for searching an element but not both. 

Finding an Element in Binary Search Trees 

Find operation is straightforward in a BST. Start with the root and keep moving left or right using the BST property. If 
the data we are searching is same as nodes data then we return current node. If the data we are searching is less than 
nodes data then search left subtree of current node otherwise search in right subtree of current node. If the data is not 
present, we end up in a NULL link. 

struct BinarySearchTreeNode *Find(struct BinarySearchTreeNode “root, int data){ 
if( root == NULL) 
return NULL; 
if( data < root—+data) 

return Find(root—deft, data); 
else if( data > root—>data) 

retum( Find( root—»right, data); 
return root; 

] 

Time Complexity: O(n), in worst case (when BST is a skew tree). Space Complexity: O(n), for recursive stack. 

Non recursive version of the above algorithm can be given as: 
struct BinarySearchTreeNode *Find(struct BinarySearchTreeNode ‘root, int data){ 
if( root == NULL) return NULL; 
while (root) { 

if(data == root—»data) 
return root; 

else if (data > root—>data) 

root = root-bright; 
else root = root—deft; 

} 

return NULL; 

3 
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Time Complexity: O(n). Space Complexity: 0(1). 


Finding Minimum Element in Binary Search Trees 

In BSTs, the minimum element is the left most node which does not has left child. In the below BST, the minimum 
element is 4. 


struct BinarySearchTreeNode *FindMin(struct BinarySearchTreeNode *root){ 
if(root == NULL) return NULL; 
else if( root—deft == NULL) 


} 


else 


return root; 

return FindMin( root—deft ); 



Time Complexity: 0(n), in worst case (when BST is a left skew tree). Space Complexity: 0(n), for recursive stack. 

Non recursive version of the above algorithm can be given as: 
struct BinarySearchTreeNode *FindMin(struct BinarySearchTreeNode * root) { 
if( root == NULL) 
return NULL; 

while (root—deft != NULL) 
root = root—deft; 
return root; 

} 

Time Complexity: 0(n). Space Complexity: 0(1). 

Finding Maximum Element in Binary Search Trees 

In BSTs, the maximum element is the right most node which does not has right child. In the below BST, the maximum 
element is 16. 

struct BinarySearchTreeNode *FindMax(struct BinarySearchTreeNode ‘root) { 
if(root == NULL) 

return NULL; 

else if( root—night == NULL ) 
return root; 

else return FindMax( root—night); 

} 

Time Complexity: 0(n), in worst case (when BST is a right skew tree). Space Complexity: 0(n), for recursive stack. 
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Non recursive version of the above algorithm can be given as: 


struct BinarySearchTreeNode *FindMax(struct BinarySearchTreeNode * root) { 
if( root == NULL) 

return NULL; 

while( root-bright != NULL) 
root = root-bright; 
return root; 

1 

Time Complexity: O(n). Space Complexity: 0(1). 

Where is Inorder Predecessor and Successor? 


Where is the inorder predecessor and successor of a node X in a binary search tree assuming all keys are distinct? 

If X has two children then its inorder predecessor is the maximum value in its left subtree and its inorder successor the 
minimum value in its right subtree. 


Predecessor(X) 


Successor® 


If it does not have a left child a nodes inorder predecessor is its first left ancestor. 
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Inserting an Element from Binary Search Tree 



To insert data into binary search tree, first we need to find the location for that element. We can find the location of 
insertion by following the same mechanism as that of find operation. While finding the location if the data is already 
there then we can simply neglect and come out. Otherwise, insert data at the last location on the path traversed. As 
an example let us consider the following tree. The dotted node indicates the element (5) to be inserted. To insert 5, 
traverse the tree as using find function. At node with key 4, we need to go right, but there is no subtree, so 5 is not in 
the tree, and this is the correct location for insertion. 

struct BinarySearchTreeNode *Insert(struct BinarySearchTreeNode ‘root, int data) { 
if( root == NULL ) { 

root = (struct BinarySearchTreeNode *) malloc(sizeof(struct BinarySearchTreeNode)); 
if( root == NULL ) { 

printf(“Memory Error”); 
return; 

} 

else { 

root—>data = data; 

root—>-left = root—>right = NULL; 

} 

} 

else { if( data < root—>data) 

root—deft = Insert(root—deft, data); 
else if( data > root—»data) 

root—aright = Insert(root—>right, data); 

} 

return root; 

} 

Note: In the above code, after inserting an element in subtrees the tree is returned to its parent. As a result, the 
complete tree will get updated. 

Time Complexity:0(n). Space Complexity:0(n), for recursive stack. For iterative version, space complexity is 0(1). 

Deleting an Element from Binary Search Tree 

The delete operation is little complicated than other operations. This is because the element to be deleted may not be 
the leaf node. In this operation also, first we need to find the location of the element which we want to delete. Once 
we have found the node to be deleted, consider the following cases: 

• If the element to be deleted is a leaf node: return NULL to its parent. That means make the corresponding 
child pointer NULL. In the below tree to delete 5, set NULL to its parent node 2. 
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If the element to be deleted has one child: In this case we just need to send the current nodes child to its 
parent. In the below tree, to delete 4, 4 left subtree is set to its parent node 2. 



• If the element to be deleted has both children: The general strategy is to replace the key of this node with the 
largest element of the left subtree and recursively delete that node (which is now empty). The largest node in 
the left subtree cannot have a right child, the second delete is an easy one. As an example, let us consider the 
following tree. In the below tree, to delete 8, it is the right child of root. The key value is 8. It is replaced with 
the largest key in its left subtree (7), and then that node is deleted as before (second case). 



struct BinarySearchTreeNode *Delete(struct BinarySearchTreeNode ‘root, int data) { 
struct BinarySearchTreeNode ‘temp; 
if( root == NULL) 

printf("Element not there in tree"); 
else if(data < root—^element) 

root—deft = Delete (root—deft, data); 
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else if(data > root—►element) 

root—>right = Delete(root—►right, data); 
else { //Found element 

if( root—deft && root—>right) { 

/* Replace with largest in left subtree */ 

temp = FindMax( root—deft); 

root—»data = temp—►element; 

root—deft = Delete(root—deft, root—►data); 

} 

else { /* One child 7 

temp = root; 
if( root—deft == NULL ) 

root = root—>right; 
if( root—>right == NULL) 
root = root—deft; 
free( temp); 

} 

} 

return root; 

} 

Time Complexity: O(n). Space Complexity; O(n) for recursive stack. For iterative version, space complexity is 0(1). 


Problems on Binary Search Trees 


Problem-46 Given pointers to two nodes in a binary search tree, find lowest common ancestor ( LCA ). Assume 
that both values already exist in the tree. 


Solution: 



The main idea of the solution is: while traversing BST from root to bottom, the first node we encounter with value 
between a and /?, i.e., a < node -> data < (3 is the Least Common Ancestor(LCA) of a and (3 (where a < (3). So 
just traverse the BST in pre-order, if we find a node with value in between a and (3 then that node is the LCA. If its 
value is greater than both a and (3 then LCA lies on left side of the node and if its value is smaller than both a and (3 
then LCA lies on right side. 


struct BinarySearchTreeNode 
while(l) { 


*FindLCA(struct BinarySearchTreeNode ‘root, 

struct BinarySearchTreeNode *a, struct BinarySearchTreeNode * (3) { 


if((a—>data < root—>-data && (3—>data > root—>data) || 
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(a—>data > root—xlata && (3—>data < root—>data)) 
return root; 

if(a—>data < root—>data) 
root = root—deft; 
else root = root—aright; 

} 

} 

T im e complexity: 0(n). Space complexity: O(n), for skew trees. 

Problem-47 Give an algorithm for finding the shortest path between two nodes in a BST. 

Solution: It’s nothing but finding the LCA of two nodes in BST. 

Problem-48 Give an algorithm for counting the number of BSTs possible with n nodes. 

Solution: This is a DP problem and refer Dynamic Programming chapter for algorithm. 

Problem-49 Give an algorithm to check whether the given binary tree is a BST or not. 

Solution: Consider the following simple program. For each node, check if left node of it is smaller than the node and 
right node of it is greater than the node. This approach is wrong as this will return true for below binary tree. 
Checking only at current node is not enough. 



int IsBST(struct BinaryTreeNode* root) { 
if(root == NULL) return 1; 

/* false if left is > than root 7 

if(root—deft != NULL && root—deft—-data > root—>data) 
return 0; 

/* false if right is < than root 7 

if(root—>right != NULL && root—>right—>data < root—>data) 
return 0; 

/* false if, recursively, the left or right is not a BST 7 
if(!IsBST(root—deft) || !IsBST(root-+right)) 
return 0; 

/* passing all that, it's a BST 7 
return 1; 

3 

Problem-50 Can we think of getting the correct algorithm? 

Solution- For each node, check if max value in left subtree is smaller than the current node data and min value in right 
subtree greater than the node data. It is assumed that we have helper functions FindMinQ and FindMaxQ that return 
the min or max integer value from a non-empty tree. 

/* Returns true if a binary tree is a binary search tree 7 
int IsBST(struct BinaryTreeNode* root) [ 
if(root == NULL) return 1; 
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/* false if the max of the left is > than root V 
iffroot—deft != NULL && FindMax(root—deft) > root—>data) 
return 0; 

/* false if the min of the right is <= than root 7 
if(root—>right != NULL && FindMin(root—>right) < root—>data) 
return 0; 

/* false if, recursively, the left or right is not a BST 7 
if(!IsBST(root—deft) || !IsBST(root—>right)) return 0; 

/* passing all that, it's a BST */ 
return 1; 

} 

Time complexity: O(n 2 ). Space Complexity: O(n). 

Problem-51 Can we improve the complexity of Problem-50? 

Solution: Yes. A better solution looks at each node only once. The trick is to write a utility helper function 
IsBSTUtil(stract BinaryTreeNode* root, int min, int max) that traverses down the tree keeping track of the narrowing 
min and max allowed values as it goes, looking at each node only once. The initial values for min and max should he 
INT_MIN and INT.MAX — they narrow from there. 

Initial call: IsBST(root, INT_MIN, INTJVtAX); 
int IsBST(struct BinaryTreeNode "root, int min, int max) { 
if(!root) return 1; 

return (root—>data >min && root—>data < max && 

IsBSTUtil(root—deft, min, root—>data) && IsBSTUtil(root—>right, root—>data, max)); 

} 

Time Complexity: O(n). Space Complexity: 0(n), for stack space. 

Problem-52 Can we further improve the complexity of Problem-50? 

Solution: Yes, using inorder traversal. The idea behind this solution is that, inorder traversal of BST produces sorted 
lists. While traversing the BST in inorder, at each node check the condition that its key value should be greater than 
the key value of its previous visited node. Also, we need to initialize the prev with possible minimum integer value 
(say, INT_MIN). 

int prev = INT_MIN; 

int IsBST(struct BinaryTreeNode ‘root, int *prev) { 
if(!root) return 1; 
if(!IsBST(root—deft, prev)) 
return 0; 

if(root—>data < *prev) 
return 0; 

‘prev = root—»data; 

return IsBST(root—»right, prev); 

} 

Time Complexity: O(n). Space Complexity: O(n), for stack space. 

Problem-53 Give an algorithm for converting BST to circular DLL with space complexity 0(1). 

Solution: Convert left and right subtrees to DLLs and maintain end of those lists. Then, adjust the pointers. 

struct BinarySearchTreeNode *BST2DLL(struct BinarySearchTreeNode ‘root, struct BinarySearchTreeNode “Ltail) { 
struct BinarySearchTreeNode left, ‘ltail, ‘right, ‘rtail; 
if(!root) { 

* ltail = NULL; 
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return NULL; 

} 

left = BST2DLL(root—deft, &ltail); 
right = BST2DLL(root—>right, &rtail); 
root—deft = ltail; 
root—bright = right; 


if(!right) 


* ltail = root; 

else { 

right—deft = root; 

1 

if(!left) 

* ltail = rtail; 


return root; 

else ( 

ltail—»right = root; 
return left; 


Time Complexity: O(n). 

Problem-54 For Problem-53, is there any other way of solving? 

Solution: Yes. There is an alternative solution based on divide and conquer method which is quite neat. 

struct BinarySearchTreeNode *Append(struct BinarySearchTreeNode *a, struct BinarySearchTreeNode *b) { 
struct BinarySearchTreeNode *aLast, *bLast; 
if (a==NULL) 
return b; 
if (b==NULL) 
return a; 
aLast = a->left; 
bLast = b->left; 
aLast->right = b; 
b->left = aLast; 
bLast->right = a; 
a->left = bLast; 
return a; 

} 

struct BinarySearchTreeNode* TreeToList(struct BinarySearchTreeNode *root) { 
struct BinarySearchTreeNode *aList, *bList; 
if (root==NULL) 

return NULL; 

aList = TreeToList(root->left); 

bList = TreeToList(root->right); 

root->left = root; 

root->right = root; 

aList = Append(aList, root); 

aList = Append(aList, bList); 

return(aList); 

} 

Time Complexity: O(n). 

Problem-55 Given a sorted doubly linked list, give an algorithm for converting it to balanced binary search tree. 
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Solution: Find the middle node and adjust the pointers, 
struct DLLNode * DLLtoBalancedBST(struct DLLNode ’head) { 
struct DLLNode ’temp, *p, *q; 
if( !head || !head—>next) 
return head; 

temp = FindMiddleNode(head); 
p = head; 

while(p—mext != temp) 
p = p—>next; 
p—>next - NULL; 
q = temp—>next; 
temp—>next = NULL; 
temp—>prev = DLLtoBalancedBST(head); 
temp—»next = DLLtoBalancedBST(q); 
return temp; 

} 

Time Complexity: 2T(n/2 ) + O(n) [for finding the middle node] = 0(nlogn). 

Note: For FindMiddleNode function refer Linked Lists chapter. 

Problem-56 Given a sorted array, give an algorithm for converting the array to BST. 

Solution: If we have to choose an array element to be the root of a balanced BST, which element we should pick? The 
root of a balanced BST should be the middle element from the sorted array. We would pick the middle element from 
the sorted array in each iteration. We then create a node in the tree initialized with this element. After the element is 
chosen, what is left? Could you identify the sub-problems within the problem? 

There are two arrays left — The one on its left and the one on its right. These two arrays are the sub-problems of the 
original problem, since both of them are sorted. Furthermore, they are subtrees of the current node’s left and right 
child. 

The code below creates a balanced BST from the sorted array in O(n) time (n is the number of elements in the array). 
Compare how similar the code is to a binary search algorithm. Both are using the divide and conquer methodology, 
struct BinaryTreeNode *BuildBST(int A[], int left, int right) [ 
struct BinaryTreeNode ’newNode; 
int mid; 
if(left > right) 

return NULL; 

newNode = (struct BinaryTreeNode *)malloc(sizeof(struct BinaryTreeNode)); 
if(!newNode) { 

printf(“Memory Error”); 
return; 

} 

if(left == right) { 

newNode—>data = A [left]; 

newNode—deft = newNode—>right = NULL; 

1 

else [ mid = left + (right-left)/ 2; 
newNode-»data = A[mid]; 
newNode—deft = BuildBST(A, left, mid - 1); 
newNode—>right = BuildBST(A, mid + 1, right); 

} 

return newNode; 
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} 

Time Complexity: O(n). Space Complexity: O(n), for stack space. 

Problem-57 Given a singly linked list where elements are sorted in ascending order, convert it to a height 
balanced BST. 

Solution: A naive way is to apply the Problem-55 solution directly. In each recursive call, we would have to traverse 
half of the list’s length to find the middle element. The run time complexity is clearly O (nlogri), where n is the total 
number of elements in the list. This is because each level of recursive call requires a total of n/2 traversal steps in the 
list, and there are a total of logn number of levels (ie, the height of the balanced tree). 

Problem-58 For Problem-57, can we improve the complexity? 

Solution: Hint: How about inserting nodes following the list’s order? If we can achieve this, we no longer need to find 
the middle element, as we are able to traverse the list while inserting nodes to the tree. 

Best Solution: As usual, the best solution requires us to think from another perspective. In other words, we no longer 
create nodes in the tree using the top-down approach. Create nodes bottom-up, and assign them to its parents. The 
bottom-up approach enables us to access the list in its order while creating nodes [42], 

Isn’t the bottom-up approach neat? Each time we are stucked with the top-down approach, give bottom-up a try. 
Although bottom-up approach is not the most natural way we think, it is extremely helpful in some cases. However, 
we should prefer top-down instead of bottom-up in general, since the latter is more difficult to verify in correctness. 

Below is the code for converting a singly linked list to a balanced BST. Please note that the algorithm requires the list’s 
length to be passed in as the function’s parameters. The list’s length could be found in O(n) time by traversing the 
entire list’s once. The recursive calls traverse the list and create tree’s nodes by the list’s order, which also takes O(n) 
time. Therefore, the overall run time complexity is still O(n). 

struct BinaryTreeNode* SortedListToBST(struct ListNode *& list, int start, int end) { 
if(start > end) 

return NULL; 

// same as (start+end)/2, avoids overflow 
int mid = start + (end - start) / 2; 

struct BinaryTreeNode TeftChild = SortedListToBST(list, start, mid-1); 
struct BinaryTreeNode * parent; 

parent = (struct BinaryTreeNode *)malloc(sizeof(struct BinaryTreeNode)); 
if(!parent) { 

printf(“Memory Error”); 
return; 

} 

parent—>data=list—>data; 
parent—deft = leftChild; 
list = list—>next; 

parent—bright = SortedListToBST(list, mid+1, end); 
return parent; 

} 

struct BinaryTreeNode * SortedListToBST(struct ListNode "head, int n) { 
return SortedListToBST(head, 0, n-1); 

} 

Problem-59 Give an algorithm for finding the k th smallest element in BST. 

Solution: The idea behind this solution is that, inorder traversal of BST produces sorted lists. While traversing the BST 
in inorder, keep track of the number of elements visited. 


6.11 Binary Search Trees (BSTs) 


158 


Data Structures and Algorithms Made Easy 


Trees 


struct BinarySearchTreeNode ‘kthSmallestlnBST(struct BinarySearchTreeNode ‘root, int k, int *count){ 
if(!root) return NULL; 

struct BinarySearchTreeNode ‘left = kthSmaUestInBST(root—deft, k, count); 
if( left) return left; 
if(++count == k) 

return root; 

return kthSmallestInBST(root—>right, k, count); 

] 

Time Complexity: O(n). Space Complexity: 0(1). 

Problem-60 Floor and ceiling: If a given key is less than the key at the root of a BST then floor of key (the largest 
key in the BST less than or equal to key) must be in the left subtree. If key is greater than the key at the root then 
floor of key could be in the right subtree, but only if there is a key smaller than or equal to key in the right subtree; 
if not (or if key is equal to the key at the root) then the key at the root is the floor of key. Finding the ceiling is 
similar with interchanging right and left.For example, if the sorted with input array is [1, 2, 8, 10, 10, 12,19], then 
For x = 0: floor doesn't exist in array, ceil = 1, For x = 1: floor =1, ceil =1 
For x = 5: floor =2, ceil = 8, Forr = 20: floor =19, ceil doesn't exist in array 

Solution: The idea behind this solution is that, inorder traversal of BST produces sorted lists. While traversing the BST 
in inorder, keep track of the values being visited. If the roots data is greater than the given value then return the 
previous value which we have maintained during traversal. If the roots data is equal to the given data then return root 
data. 

struct BinaryTreeNode ‘FloorlnBST(struct BinaryTreeNode ‘root, int data){ 
struct BinaryTreeNode *prev=NULL; 
return FloorInBSTUtil(root, prev, data); 

} 

struct BinaryTreeNode *FloorInBSTUtil(struct BinaryTreeNode ‘root, struct BinaryTreeNode ‘prev, int data){ 
if(!root) 

return NULL; 

if(!FloorInBSTUtil(root—deft, prev, data)) 
return 0; 

if(root—»data == data) 
return root; 
if(root—>data > data) 
return prev; 
prev = root; 

return FloorInBSTUtil(root—>right, prev, data); 

} 

Time Complexity: O(n). Space Complexity: O(n), for stack space. 

For ceiling, we just need to call the right subtree first and then followed by left subtree. 

struct BinaryTreeNode *CeilingInBST(struct BinaryTreeNode ‘root, int data){ 
struct BinaryTreeNode *prev=NULL; 
return CeilingInBSTUtil(root, prev, data); 

} 

struct BinaryTreeNode *CeilingInBSTUtil(struct BinaryTreeNode ‘root, struct BinaryTreeNode ‘prev, int data){ 
if(!root) 

return NULL; 

if(!CeilingInBSTUtil(root—>right, prev, data)) 
return 0; 

if(root—>data == data) return root; 
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if(root—>data < data) return prev; 
prev = root; 

return CeilingInBSTUtil(root—>left, prev, data); 

} 

Time Complexity: O(n). Space Complexity: 0(n), for stack space. 

Problem-61 Give an algorithm for finding the union and intersection of BSTs. Assume parent pointers are 
available (say threaded binary trees). Also, assume the lengths of two BSTs are m and n respectively. 

Solution: If parent pointers are available then the problem is same as merging of two sorted lists. This is because if we 
call inorder successor each time we get the next highest element. It’s just a matter of which InorderSuccessor to call. 

Time Complexity: 0(m + n). Space complexity: 0(1). 

Problem-62 For Problem-61, what if parent pointers are not available? 

Solution: If parent pointers are not available then, one possibility is converting the BSTs to linked lists and then 
merging. 

1 Convert both the BSTs into sorted doubly linked lists in 0(n + m) time. This produces 2 sorted lists. 

2 Merge the two double linked lists into one and also maintain the count of total elements in 0(n + m) time. 

3 Convert the sorted doubly linked list into height balanced tree in O (n + m) time. 

Problem-63 For Problem-61, is there any alternative way of solving the problem? 

Solution: Yes, using inorder traversal. 

• Perform inorder traversal on one of the BST. 

• While performing the traversal store them in table (hash table). 

• After completion of the traversal of first BST, start traversal of the second BST and compare them with hash 
table contents. 

Time Complexity: 0(m + n). Space Complexity: 0(Max(m, n)). 

Problem-64 Given a BST and two numbers Kl and K2, give an algorithm for printing all the elements of BST in 
the range K 1 and K2. 

Solution: 

void RangePrinter(struct BinarySearchTreeNode ‘root, int Kl, int K2) { 
if(root == NULL) 
return; 

if(root—>data >= Kl) 

RangePrinter(root—i-left, Kl, K2); 
if(root—>data >= Kl && root—>data <= K2) 
printf(“%d”, root—>data); 
if(root—>data <= K2) 

RangePrinter(root—>right, Kl, K2); 

1 

Time Complexity: O(n). Space Complexity: O(n), for stack space. 

Problem-65 For Problem-64, is there any alternative way of solving the problem? 

Solution: We can use level order traversal: while adding the elements to queue check for the range, 
void RangeSeachLevelOrder(struct BinarySearchTreeNode ‘root, int Kl, int K2){ 
struct BinarySearchTreeNode ‘temp; 
struct Queue *Q_= CreateQueue(); 
if(!root) 

return NULL; 

Q= EnQueue(Q, root); 
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while(!IsEmptyQueue(Q)) { 
temp=DeQueue(Q); 

if(temp—>data >= K1 && temp—>data <= K2) 
printf(“%d”,temp—>data); 
if(temp—deft && temp—>data >= Kl) 

EnQueue(Q, temp—deft); 
if(temp—> right && temp—»data <= K2) 

EnQueue(Q, temp-»right); 

} 

DeleteQueue(Q); 
return NULL; 

3 

Time Complexity: O(n). Space Complexity: O(n), for queue. 

Problem-66 For Problem-64, can we still think of alternative way for solving the problem? 

Solution: First locate Kl with normal binary search and after that use InOrder successor until we encounter K2. For 
algorithm, refer problems section of threaded binary trees. 

Problem-67 Given root of a Binary Search tree, trim the tree, so that all elements in the new tree returned are 
between the inputs A and B. 

Solution: It’s just another way of asking the Problem-64. 

Problem-68 Given two BSTs, check whether the elements of them are same or not. For example: two BSTs with 
data 10 5 20 15 30 and 10 20 15 30 5 should return true and the dataset with 10 5 20 15 30 
and 10 15 30 20 5 should return false. Note: BSTs data can be in any order. 

Solution: One simple way is performing a traversal on first tree and storing its data in hash table. As a second step 
perform traversal on second tree and check whether that data is already there in hash table or not. During the 
traversal of second tree if we find any mismatch return false. 

Time Complexity: O (max(m,ri)), where m and n are the number of elements in first and second BST. Space 
Complexity: 0(max(m, n)). This depends on the size of the first tree. 

Problem-69 For Problem-68, can we reduce the time complexity? 

Solution: Instead of performing the traversals one after the other, we can perform in — order traversal of both the 
trees in parallel. Since the in — order traversal gives the sorted list, we can check whether both the trees are 
generating the same sequence or not. 

Time Complexity: O (max(m,ri)). Space Complexity: 0(1). This depends on the size of the first tree. 

Problem-70 For the key values 1... n, how many structurally unique BSTs are possible that store those keys. 

Solution: Strategy: consider that each value could be the root. Recursively find the size of the left and right subtrees, 
int CountTrees(int n) { 

if(n<=l) return 1; 

else { // there will be one value at the root, with whatever remains on the left and right 

// each forming their own subtrees.Iterate through all the values that could be the root... 

int sum = 0; 

int left, right, root; 

for (root=l; root<=n; root++) { 

left = CountTrees(root - 1); 

right = CountTrees(numKeys - root); 

// number of possible trees with this root == left'right 
sum += left*right; 
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return(sum); 

} 

} 

6.12 Balanced Binary Search Trees 

In earlier sections we have seen different trees whose worst case complexity is O(n), where n is the number of nodes 
in the tree. This happens when the trees are skew trees. In this section we will try to reduce this worst case complexity 
to O (logri) by imposing restrictions on the heights. In general, the height balanced trees are represented with HB(k), 
where k is the difference between left subtree height and right subtree height. Sometimes k is called balance factor. 

Complete Balanced Binary Search Trees 

In HB(k), if k = 0 (if balance factor is zero), then we call such binary search trees as full balanced binary search 
trees. That means, in HB(0) binary search tree, the difference between left subtree height and right subtree height 
should be at most zero. This ensures that the tree is a full binary tree. For example, 



Note: For constructing HB{ 0) tree refer problems section. 

6.13 AVL (Adelson-Velskii and Landis) Trees 

In HB(k), if k = 1 (if balance factor is one), such binary search tree is called an AVL tree. That means an AVL tree is 
a binary search tree with a balance condition: the difference between left subtree height and right subtree height is at 
most 1. 

Properties of AVL Trees 

A binary tree is said to be an AVL tree, if: 

• It is a binary search tree, and 

• For any node X, the height of left subtree of X and height of right subtree of X differ by at most 1. 



As an example among the above binary search trees, the left one is not an AVL tree, whereas the right binary search 
tree is an AVL tree. 
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Minimum/Maximum Number of Nodes in AVL Tree 

For simplicity let us assume that the height of an AVL tree is h and NQi) indicates the number of nodes in AVL tree 
with height h. To get minimum number of nodes with height h, we should fill the tree with as minimum nodes as 
possible. That means if we fill the left subtree with height h—1 then we should fill the right subtree with height 
h — 2. As a result, the minimum number of nodes with height h is: 

N(h) = N(h - 1) + N(h - 2) + 1 

In the above equation: 

• fV(/i — 1) indicates the minimum number of nodes with height h — 1. 

• N(h — 2) indicates the minimum number of nodes with height h — 2. 

• In the above expression, “1” indicates the current node. 

We can give N(h — 1) either for left subtree or right subtree. Solving the above recurrence gives: 

N(h) =0(1.618 h ) => h = 1.44logn «0 (logn) 



Where n is the number of nodes in AVL tree. Also, the above derivation says that the maximum height in AVL trees is 
O(logn ). Similarly, to get maximum number of nodes, we need to fill both left and right subtrees with height h — 
1. As a result, we get 

N(h) = N(h - 1) + N(h - 1) + 1 = 2N(h - 1) + 1 
The above expression defines the case of full binary tree. Solving the recurrence we get: 

N(h) =0(2 ft ) h = logn «0 (logri) 

••• In both the cases, AVL tree property is ensuring that the height of an AVL tree with n nodes is O (logn). 

AVL Tree Declaration 

Since AVL tree is a BST, the declaration of AVL is similar to that of BST. But just to simplify the operations, we 
include the height also as part of declaration, 
struct AVLTreeNode{ 

struct AVLTreeNode ’left; 
int data; 

struct AVLTreeNode ‘right; 
int height; 

}; 

Finding Height of an AVL tree 

int Height(struct AVLTreeNode ‘root){ 
if( !root) return-1; 
else return root—^height; 

} 

Time Complexity: 0(1). 

Rotations 

When the tree structure changes (e.g., with insertion or deletion), we need to modify the tree to restore the AVL tree 
property. This can be done using single rotations or double rotations. Since an insertion/deletion involves 
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adding/deleting a single node, this can only increase/decrease the height of some subtree by 1. So, if the AVL tree 
property is violated at a node X, it means that the heights of left(A) and right(A) differ by exactly 2. This is because, if 
we balance the AVL tree every time, then at any point, the difference in heights of left(A) and right(A) differ by 
exactly 2. Rotations is the technique used for restoring the AVL tree property. That means, we need to apply the 
rotations for the node X. 

Observation: One important observation is that, after an insertion, only nodes that are on the path from the insertion 
point to the root might have their balances altered because only those nodes have their subtrees altered. To restore 
the AVL tree property, we start at the insertion point and keep going to root of the tree. While moving to root, we 
need to consider the first node whichever is not satisfying the AVL property. From that node onwards every node on 
the path to root will have the issue. Also, if we fix the issue for that first node, then all other nodes on the path to root 
will automatically satisfy the AVL tree property. That means we always need to care for the first node whichever is 
not satisfying the AVL property on the path from insertion point to root and fix it. 

Types of Violations 

Let us assume the node that must be rebalanced is X. Since any node has at most two children, and a height imbalance 
requires that X’s two subtrees’ heights differ by two. We can easily observe that a violation might occur in four cases: 

1. An insertion into the left subtree of the left child of X. 

2. An insertion into the right subtree of the left child of X. 

3. An insertion into the left subtree of the right child of X. 

4. An insertion into the right subtree of the right child of X. 

Cases 1 and 4 are symmetric and easily solved with single rotations. Similarly, cases 2 and 3 are also symmetric and 
can be solved with double rotations (needs two single rotations). 

Single Rotations 

Left Left Rotation (LL Rotation) [Case-1]: In the below case, at node X, the AVL tree property is not satisfying. As 
discussed earlier, rotation does not have to be done at the root of a tree. In general, we start at the node inserted and 
travel up the tree, updating the balance information at every node on the path. 
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For example, in above figure, after the insertion of 7 in the original AVL tree on the left, node 9 becomes unbalanced. 
So, we do a single left-left rotation at 9. As a result we get the tree on the right. 

struct AVLTreeNode *SingleRotateLeft(struct AVLTreeNode *X){ 
struct AVLTreeNode *W = X—deft; 

X—deft = W-> right; 

W—>right = X; 

X—>height = max( Height(X—deft), Height(X—+right)) + 1; 

W—dieight = max( Height(W—deft), X—>height) + 1; 
returnW; /‘Newroot*/ 

} 

Time Complexity: 0(1). Space Complexity: 0(1). 

Right Right Rotation (RR Rotation) [Case-4]: In this case, the node X is not satisfying the AVL tree property. 




For example, in above figure, after the insertion of 29 in the original AVL tree on the left, node 15 becomes 
unbalanced. So, we dn a single right-right rotation at 15. As a result we get the tree on the right. 



struct AVLTreeNode *SingleRotateRight(struct AVLTreeNode *W) { 
struct AVLTreeNode *X = W—aright; 

W—>right = X—deft; 

X—deft = W; 

W—^height = max( Height(W—>right), Height(W—deft)) + 1; 
X—^height = max( Height(X—>right), W—^height) + 1; 
return X; 
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1 

Time Complexity: 0(1). Space Complexity: 0(1). 

Double Rotations 

Left Right Rotation (LR Rotation) [Case-2]: For case-2 and case-3 single rotation does not fix the problem. We need to 
perform two rotations. 



As an example, let us consider the following tree: Insertion of 7 is creating the case-2 scenario and right side tree is the 
one after double rotation. 



Code for left-right double rotation can be given as: 

struct AVLTreeNode *DoubleRotatewithLeft( struct AVLTreeNode *Z ){ 

Z—deft = SingleRotateRight( Z—deft); 
return SingleRotateLeft(Z); 

} 

Right Left Rotation (RL Rotation) [Case-3]: As similar to case-2, we need to perform two rotations for fixing this 
scenario. 
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As an example, let us consider the following tree: Insertion of 6 is creating the case-3 scenario and right side tree is the 
one after double rotation. 



Insertion into an AVL tree 

Insertion in AVL tree is very much similar to BST insertion. After inserting the element, we just need to check 
whether there is any height imbalance. If there is any imbalance, call the appropriate rotation functions. 

struct AVLTreeNode *Insert( struct AVLTreeNode ‘root, struct AVLTreeNode ‘parent, int data){ 
if( !root) { 

root = (struct AVLTreeNode*) malloc(sizeof (struct AVLTreeNode*)); 
if(!root) {printf(“Memory Error”); 
return; 

] 

else { root—>data = data; 

root—^height = 0; 

_ root—>left = root—>right = NULL; _ 
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] 

3 

else if( data < root—>data) { 

root—deft = Insert( root—deft, root, data); 
if( (Height( root—deft) - Height( root—>right)) == 2 ) { 
if( data < root—deft—> data ) 

root = SingleRotateLeft( root); 
else root = DoubleRotateLeft( root); 

} 

} 

else if( data > root—>data) { 

root^right = Insert( root—>right, root, data); 
if( ( Height( root—♦right) - Height( root—>left)) == 2 ) { 
if( data < root—bright—>data) 

root = SingleRotateRight( root); 
else root = DoubleRotateRight( root); 

) 

} 

/* Else data is in the tree already. We'll do nothing */ 

root—^height = max( Height(root—deft), Height(root—>right)) + 1; 

return root; 

} 

Time Complexity: O(n). Space Complexity: O (logri). 

Problems on AVL Trees 

Problem-71 Given a height h, give an algorithm for generating the HB( 0). 

Solution: As we have discussed, HB( 0) is nothing but generating full binary tree. In full binary tree the number of 
nodes with height h are: 2 h+1 - 1 (let us assume that the height of a tree with one node is 0). As a result the nodes can 
be numbered as: 1 to 2 h+1 — 1. 

struct BinarySearchTreeNode *BuildHB0(int h){ 
struct BinarySearchTreeNode ’temp; 
if(h == 0) 

return NULL; 

temp = (struct BinarySearchTreeNode *) malloc (sizeof(struct BinarySearchTreeNode)); 
temp—deft = BuildHBO (h-1); 

temp—>data = count++; //assume count is a global variable 

temp—>right = BuildHBO (h-1); 
return temp; 

} 

Time Complexity: O(n). Space Complexity: O (logri), where logn indicates the maximum stack size which is equal to 
height of tree. 

Problem-72 Is there any alternative way of solving Problem-71? 

Solution: Yes, we can solve following Mergesort logic. That means, instead of working with height, we can take the 
range. With this approach we do not need any global counter to be maintained. 

Struct BinarySearchTreeNode *BuildHB0(int 1, int r){ 
struct BinarySearchTreeNode *temp; 
int mid = l + r —\ 
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if( 1 > r) 

return NULL; 

temp = (struct BinarySearchTreeNode *) malloc (sizeof(struct BinarySearchTreeNode)); 

temp—>data = mid; 

temp—deft = BuildHBO(l, mid-1); 

temp—bright = BuildHBO(mid+l, r); 

return temp; 

} 

The initial call to BuildHB 0 function could be: BuildHB 0(1, 1 « h). 1 « h does the shift operation for calculating the 
2 h+1 - 1 . 


Time Complexity: O(n). Space Complexity: O(logn). Where logn indicates maximum stack size which is equal to 
height of the tree. 


Problem-73 Construct minimal AVL trees of height 0,1,2, 3,4, and 5. What is the number of nodes in a minimal 
AVL tree of height 6? 


Solution Let N(h) be the 
JV( 0) = 1 

]V(1) = 2 


number of nodes in a minimal AVL tree with height h. 

O 



N(h) = 1 + JV(/i - 1) + N(h - 2) 


N(2) = 1 + N( 1) + N( 0) 
= l + 2 + l = 4 



N( 3) = 14- N( 2) + N( 1) 
= l + 4 + 2 = 7 


N( 4) = 1 + N(3) + N( 2) 
= 1 + 7 + 4= 12 


N( 5) = 1 + N( 4) + N(3) 
= 1 + 12 + 7 = 20 



Problem-74 For the Problem-71 how many different shapes of a minimal AVL tree of height h can have? 

Solution: Let NS(h ) be the number of different shapes of a minimal AVL tree of height h. 

NS (O') = 1 0 

ns(i) = 2 y* c \ D 
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JVS(2) = 2 * NS(1) * NS( 0) 
= 2*2*1 = 4 


NS( 3) = 2 * NS(2) * NS( 1) 

= 2*4*1 = 8 


IVS(/i) = 2 * JVS(/i - 1) * NS(h - 2) 


Problem-75 Given a binary search tree check whether the tree is an AVL tree or not? 

Solution: Let us assume that IsAVL is the function which checks whether the given binary search tree is an AVL tree 
or not. IsAVL returns -1 if the tree is not an AVL tree. During the checks each node sends height of it to their parent, 
int IsAVL(struct BinarySearchTreeNode *root){ 
int left, right; 
if(!root) 
return 0; 

left = IsAVL(root—>left); 
if(left == -1) 

return left; 

right = IsAVL(root—>right); 
if(right == -1) 

return right; 
if(abs(left-right) > 1) 
return-1; 

return Max(left, right)+l; 

1 

Time Complexity: O(n). Space Complexity: O(n), 

Problem-76 Given a height h, give an algorithm to generate an AVL tree with min number of nodes. 

Solution: To get minimum number of nodes, fill one level with h — 1 and other with h — 2. 

struct AVLTreeNode *GenerateAVLTree(int h){ 
struct AVLTreeNode "temp; 
if(h == 0) 

return NULL; 

temp = (struct AVLTreeNode *)malloc (sizeof(struct AVLTreeNode)); 

temp—deft = GenerateAVLTree(h-l); 

temp—+data = count++; //assume count is a global variable 

temp—>right = GenerateAVLTree(h-2); 

temp—>height = temp—deft-»height+l; // or temp—>height = h; 
return temp; 

} 

Problem-77 Given an AVL tree with n integer items and two integers a and b, where a and b can be any integers 
with a <= b. Implement an algorithm to count the number of nodes in the range [a, b]. 
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_ , . root 

Solution: 


The idea is to make use of the recursive property of binary search trees. There are three cases to consider, whether the 
current node is in the range [a, b], on the left side of the range [a, b] or on the right side of the range [a, h]. Only 
subtrees that possibly contain the nodes will be processed under each of the three cases. 

int RangeCount(struct AVLNode *root, int a, int b) { 
if(root == NULL) 
return 0; 

else if(root—>data > b) 

return RangeCount(curr—deft, a, b); 
else if(root—>data < a) 

return RangeCount(root->right, a, b); 
else if(root—>data >= a && root—>data <= b) 

return RangeCount(root—deft, a, b) + RangeCount(root—bright, a, b) + 1; 

1 

The complexity is similar to in — order traversal of the tree but skipping left or right sub-trees when they do not 
contain any answers. So in the worst case, if the range covers all the nodes in the tree, we need to traverse all the n 
nodes to get the answer. The worst time complexity is therefore O(n). 

If the range is small, which only covers few elements in a small subtree at the bottom of the tree, the time complexity 
will be 0(h) =0 (logn), where h is the height of the tree. This is because only a single path is traversed to reach the 
small subtree at the bottom and many higher level subtrees have been pruned along the way. 

Note: Refer similar problem in BST. 

6.14 Other Variations in Trees 

In this section, let us enumerate the other possible representations of trees. In the earlier sections, we have seen AVL 
trees which is a binary search tree (BST) with balancing property. Now, let us see few more balanced binary search 
trees: Red-Black Trees and Splay Trees. 

Red-Black Trees 

In red-black trees each node is associated with extra attribute: the color, which is either red or black. To get 
logarithmic complexity we impose the following restrictions. 

Definition: A red-black tree is a binary search tree that satisfies the following properties: 

• Root Property: the root is black 

• External Property: every leaf is black 

• Internal Property: the children of a red node are black 

• Depth Property: all the leaves have the same black 
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As similar to AVL trees, if the Red-black tree becomes imbalanced then we perform rotations to reinforce the 
balancing property. With Red-black trees, we can perform the following operations in O (logn) in worst case, where 
n is the number of nodes in the trees. 

• Insertion, Deletion 

• Finding predecessor, successor 

• Finding minimum, maximum 

Splay Trees 

Splay-trees are BSTs with self-adjusting property. Another interesting property of splay-trees is: starting with empty 
tree, any sequence of K operations with maximum of n nodes takes O (Klogn) time complexity in worst case. 

Splay trees are easier to program and also ensures faster access to recently accessed items. As similar to AVL and Red- 
Black trees, at any point if the splay tree becomes imbalanced then we perform rotations to reinforce the balancing 
property. 

Splay-trees cannot guarantee the O (logn) complexity in worst case. But it gives amortized 0( logn) complexity. Even 
though individual operations can be expensive, any sequence of operations gets the complexity of logarithmic 
behavior. One operation may take more time (a single operation may take 0(n) time) but the subsequent operations 
may not take worst case complexity and on the average per operation complexity is O (logn). 

Augmented Trees 

In earlier sections, we have seen the problems like finding K th —smallest element in the tree and many other similar 
problems. For all those problems the worst complexity is O(n), where n is the number of nodes in the tree. To perform 
such operations in O (logn) augmented trees are useful. In these trees, extra information is added to each node and 
that extra data depends on the problem we are trying to solve. For example, to find K th —smallest in binary search 
tree, let us see how augmented trees solves the problem. Let us assume that we are using Red-Black trees as balanced 
BST (or any balanced BST) and augment the size information in the nodes data. For a given node X in Red-Black tree 
with a field size (A) equal to the number of nodes in the subtree and can be calculated as: 

size (A) = size (A -* left ) + size (A -» right)) + 1 

Example: With the extra size information, the augmented tree will look like: 



Kth-smallest operation can be defined as: 

struct BinarySearcTreeNode *KthSmallest (struct BinarySearcTreeNode *X, int K) { 
int r = size(X—>left) + 1; 
if(K == r) 

return X; 

if(K < r) 

return KthSmallest (X—deft, K); 

if(K > r) 
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return KthSmallest (X—►right, K-r); 

} 

Time Complexity: O (logri). Space Complexity: O (logri). 

Interval Trees 

Interval trees are also binary search trees and stores interval information in the node structure. That means, we 
maintain a set of n intervals [i 1; i 2 ] such that one of the intervals containing a query point Q (if any) can be found 
efficiently. Interval trees are used for performing range queries efficiently. 

Example: Given a set of intervals: S= {[2-5], [6-7], [6-10], [8-9], [12-15], [15-23], [25-30]]. A query with Q = 9 
returns [6,10] or [8,9] (assume these are the intervals which contains 9 among all the intervals). A query with Q = 
23 returns [15, 23], 



Query Line 






Construction of Interval Trees: Let us assume that we are given a set S of n intervals (also called segments). These n 
intervals will have 2n endpoints. Now, let us see how to construct the interval tree. 

Algorithm: 

Recursively build tree on interval set S as follows: 

• Sort the 2 n endpoints 

• Let X mid be the median point 



Intervals that are Intervals that are 

completely to the left of completely to the 

X m id i n n—>left right of X mid in n—bright 

Time Complexity for building interval trees: O (nlogri). Since we are choosing the median, Interval Trees will be 
approximately balanced. This ensures that, we split the set of end points up in half each time. The depth of the tree is 
O(logn). To simplify the search process, generally X mid is stored with each node. 
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Chapter 12 SEARCHING 


What is Searching? 

In computer science, searching is the process of finding an item with specified properties among a 
collection of items. The items may be stored as records in a database, simple data elements in arrays, 
text in files, nodes in trees, vertices and edges in graphs or may be elements of other search space. 


Why Searching? 

Searching is one of core computer science algorithms. We know that today’s computers store lot of 
information. To retrieve this information efficiently we need very efficient searching algorithms. 

There are certain ways of organizing the data which improves the searching process. That means, if 
we keep the data in some proper order then it is easy to search the required element. Sorting is one of 
the techniques for making the elements ordered. 

In this chapter we will see different searching algorithms. 

Types of Searching 

The following are the types of searches which we will be discussing in this book. 

• Unordered Linear Search 

• Sorted/Ordered Linear Search 

• Binary Search 

• Symbol Tables and Hashing 

• String Searching Algorithms: Tries, Ternary Search and Suffix Trees 

Unordered Linear Search 

Let us assume that given an array whose elements order is not known. That means the elements of 
the array are not sorted. In this case if we want to search for an element then we have to scan the 
complete array and see if the element is there in the given list or not. 

int UnsorteddLinearSearch (int A[], int n, int data) 
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for (int i = 0; i < n; i++) 

1 

if (A[i] == data) 
return i; 

} 

return -1; 

} 

Time complexity of this algorithm is 0(ri). This is because in the worst case we need to scan the 
complete array. 

Space complexity: 0(1). 

Sorted/Ordered Linear Search 

If the elements of the array are already sorted then in many cases we don’t have to scan the complete 
array to see if the element is there in the given array or not. In the below algorithm, it can be seen 
that, at any point if the value at A[i] is greater than the data to be searched then we just return — 1 
without searching the remaining array. 

int SortedLinearSearch(int A[], int n, int data) 

1 

for (int i = 0; i < n; i++) 

1 

if (A[i] == data) 
return i; 

else if(A[i] > data) 
return -1; 

3 

return -1; 

} 

Time complexity of this algorithm is 0(n). This is because in the worst case we need to scan the 
complete array. But in the average case it reduces the complexity even though the growth rate is 
same. 

Space complexity: 0(1). 

Note: For the above algorithm we can make further improvement by incrementing the index at faster 
rate (say, 2). This will reduce the number of comparisons for searching in the sorted list. 

Binary Search 
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If we consider searching of a word in a dictionary, in general we directly go some approximate page 
[generally middle page] start searching from that point. If the name that we are searching is same 
then we are done with the search. If the page is before the selected pages then apply the same process 
for the first half otherwise apply the same process to the second half. Binary search also works in the 
same way. The algorithm applying such a strategy is referred to as binary search algorithm. 

//Iterative Binary Search Algorithm 

int BinarySearchIterative[int A[], int n, int data) 

{ 

int low = 0; 
int high = n-1; 
while (low <= high) 

1 

mid = low + (high-low)/2; //To avoid overflow 

if (A[mid] == data) 
return mid; 
else if (A[mid] < data) 
low = mid + 1; 

else 

high = mid - 1; 

3 

return -1; 

} 

//Recursive Binary Search Algorithm 

int BinarySearchRecursive[int A[], int low, int high, int data) 

1 

int mid = low + (high-low)/2; //To avoid overflow 

if (A[mid] == data) 
return mid; 
else if (A[mid] < data) 

return BinarySearchRecursive (A, mid + 1, high, data); 

else 

return BinarySearchRecursive (A, low, mid - 1 , data); 
return -1; 

} 

Recurrence for binary search is T(ri) = + 0(1)- This is because we are always considering 

only half of the input list and throwing out the other half. Using Divide and Conquer master 
theorem, we get, T(n) — 0(logri). 
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Time Complexity: O(logri). 

Space Complexity: 0(1) [for iterative algorithm]. 

Comparing Basic Searching Algorithms 


Implementation 

Search-Worst Case 

Search-Avg. Case 

Unordered Array 

n 

n 

2 

Ordered Array 

logn 

logn 

Unordered List 

n 

n 

2 

Ordered List 

n 

n 

2 

Binary Search (arrays) 

logn 

logn 

Binary Search Trees (for skew trees) 

n 

logn 


Note: For discussion on binary search trees refer Trees chapter. 

Symbol Tables and Hashing 

Refer Symbol Tables and Hashing chapters. 

String Searching Algorithms 

Refer String Algorithms chapter. 

Problems on Searching 

Problem-1 Given an array of n numbers. Give an algorithm for checking whether there are any 
duplicated elements in the array or not? 

Solution: This is one of the simplest problems. One obvious answer to this is, exhaustively searching 
for duplicated in the array. That means, for each input element check whether there is any element 
with same value. This we can solve just by using two simple yon loops. The code for this solution can 
be given as: 

void CheckDuplicatesBruteForce(int A[], int n) 

1 

int i = 0, j=0; 
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for(i = 0; i < n; i++) 

1 

for(j = i+1; j < n; j++) 

1 

if(A[i] == A[j]) 

1 

printf(“Duplicates exist: %d”, A[i]); 
return; 

3 

3 


3 

printf(“No duplicates in given array.”); 

} 

Time Complexity: 0(n 2 ). This is because of two nested for loops. 

Space Complexity: 0(1). 

Problem-2 Can we improve the complexity of Problem- l’s solution? 

Solution: Yes. Sort the given array. After sorting all the elements with equal values come adjacent. 
Now, just do another scan on this sorted array and see if there are elements with same value and 
adjacent. 

void CheckDuplicatesBruteForce(int A[], int n) 

1 

//sort the array 
Sort(A, n); 

for(int i = 0; i < n-1; i++) 

1 

if(A[i] == A[i+1]) 

1 

printf(“Duplicates exist: %d”, A[i]); 
return; 

} 

3 

printf(“No duplicates in given array.”); 

} 

Time Complexity: Ofnlogn). This is because of sorting. 

Space Complexity: 0(1). 

Problem-3 Is there any other way of solving the Problem-1? 
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Solution: Yes, using hash table. Hash tables are a simple and effective method to implement 
dictionaries. Average time to search for an element is 0(1), while worst-case time is 0(n). Refer 
Hashing chapter for full details on hashing algorithms. 

For example, consider the array, A = (3,2,1, 2,2,3). Scan the input array and insert the elements 
into the hash. For inserted element, keep the counter as 1. This indicates that the corresponding 
element has occurred already. For the given array, the hash table will look like (after inserting first 
three elements 3, 2 and 1): 

> 1 


1 


> 1 


If we try inserting 2, since the counter value of 2 is already 1, then we can say the element is 
appearing twice. 


3 


2 


1 


Time Complexity: 0(n). 
Space Complexity: 0 ( n ). 


Problem-4 Can we further improve the complexity of Problem-1 ’s solution? 


Solution: Let us assume that the array elements are positive numbers and also all the elements are in 
the range 0 to n — 1. For each element A[i], we go to the array element whose index is A[i], That 
means we select A [A [t]] and mark - 4[4[t]] (that means we negate the value at 4[4[i]]). We continue 
this process until we encounter the element whose value is already negated. If one such element 
exists then we say duplicate elements exist in the given array. As an example, consider the array, 
A = (3,2,1,2,2,3). 


Initially, 

3 

2 

1 

2 

2 

3 






0 

1 

2 

3 


4 

5 





At step-1, negate A[abs(A[0])], 


3 

2 

1 

-2 

2 

3 







0 

1 

2 

3 

4 

5 

At step-2, negate A[abs(A[l])], 


3 

2 

-1 

-2 

2 

3 







0 

1 

2 

3 

4 

5 
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At step-3, negate A[abs(A[2])], 


At step-4, negate A[abs(A[3])], 


3 

-2 

-1 

-2 

2 

3 

0 

1 

2 

3 

4 

5 

3 

- 2 

-1 

-2 

2 

3 

0 

1 

2 

3 

4 

5 


At step-4, we can observe that d[abs(i4[3])] is already negative. That means we have encountered 
the same value twice. 


The code for this algorithm can be given as: 

void CheckDuplicates(int A[], int n) 

1 

int i = 0; 

for(i = 0; i < n; i++) 

1 

if(A[abs(A[i])] < 0) 

1 

printf(“Duplicates exist:%d”, A[i]); 
return; 

} 

else 

1 

A[A[i]] = -A[A[i]]; 

} 

} 

printf(“No duplicates in given array.”); 

} 

Time Complexity: 0(n). Since, only one scan is required. 

Space Complexity: 0(1). 

Note: 

• This solution does not work if the given array is read only. 

• This solution will work only if all the array elements are positive. 

• If the elements range is not in 0 to n — 1 then it may give exceptions. 

Problem-5 Given an array of n numbers. Give an algorithm for finding the first element in the 
array which is repeated? 
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For example, consider the array, A = (3,2,1,2,2,3). In this array the first repeated number is 3 
(not 2). That means, we need to return the first element among the repeated elements. 

Solution: We can use the brute force solution of Problem-1. Because it for each element it checks 
whether there is a duplicate for that element or not. So, whichever element duplicates first then that 
element is returned. 

Problem-6 For Problem-5, can we use sorting technique? 

Solution: No. For proving the failed case, let us consider the following array. For example, A = 
(3, 2,1, 2, 2, 3). Then after sorting we get A = (1, 2, 2, 2, 3, 3). In this sorted array the first repeated 
element is 2 but the actual answer is 3. 

Problem-7 For Problem-5, can we use hashing technique? 

Solution: Yes. But the simple technique which we used for Problem-3 will not work. For example, if 
we consider the input array as A — (3, 2,1, 2,3), in this case the first repeated element is 3 but using 
our simple hashing technique we the answer as 2. This is because of the fact that 2 is coming twice 
before 3. Now let us change the hashing table behavior so that we get the first repeated element. 

Let us say, instead of storing 1 value, initially we store the position of the element in the array. As a 
result the hash table will look like (after inserting 3, 2 and 1): 


* 1 


♦ 2 


*- 3 

Now, if we see 2 again, we just negate the current value of 2 in the hash table. That means, we make 
its counter value as —2. The negative value in the hash table indicates that we have seen the same 
element two times. Similarly, for 3 (next element in input) also, we negate the current value of hash 
table and finally the hash table will look like: 


3 


2 


1 




3 



2 


i 




-1 

-2 

3 


After scanning the complete array, we scan the hash table and return the highest negative indexed 
value from it (i.e., —1 in our case). The highest negative value indicates that we have seen that 
element first (among repeated elements) and also repeating. 
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What if the element is repeated more than two times? 

In this case, what we can do is, just skip the element if the corresponding value i already negative. 

Problem-8 For Problem-5, can we use Problem-3’s technique (negation technique)? 

Solution: No. As a contradiction example, for the array A = (3,2,1, 2, 2, 3} the first repeated 
element is 3. But with negation technique the result is 2. 

Problem-9 Given an array of n elements. Find two elements in the array such that their sum is 
equal to given element K7 

Solution: Brute Force Approach 

One simple solution to this is, for each input element check whether there is any element whose sum 
is K. This we can solve just by using two simple for loops. The code for this solution can be given as: 

void BruteForceSearchjint A[], int n, int K) 

{ 

int i = 0, j = 0; 
for (i = 0; i < n; i++) 

1 

for(j = i; j < n; j++) 

( 

if(A[i]+A[j] == K) 

1 

printf(“Items Found:°/od %d”, i, j); 
return; 

} 

} 

} 

printf(“Items not found: No such elements”); 

} 

Time Complexity: 0(n 2 ). This is because of two nested for loops. 

Space Complexity: 0(1). 

Problem-10 Does the solution of Problem-9 works even if the array is not sorted? 

Solution: Yes. Since we are checking all possibilities, the algorithm ensures that we get the pair of 
numbers if they exist. 

Problem-11 For the Problem-9, can we improve the time complexity? 


422 


Searching | Problems on Searching 






www.CareerMonk.com 


Data Structures and Algorithms Made Easy 


Solution: Yes. Let us assume that we have sorted the given array. This operation takes 0(n logri). On 
the sorted array, maintain indices loIndex = 0 and hilndex = n — 1 and compute A\loIndex] + 
A[hilndex], If the sum equals K, then we are done with the solution. If the sum is less than K, 
decrement hilndex , if the sum is greater than K, increment lolndex. 

void Search[int A[], int n, int K) 

1 

int i, j, temp; 

Sort(A, n); 

for(i = 0, j = n-1; i < j; ) 

1 

temp =A[i]+A[j]; 
if (temp == K) 

1 

printf("Elements Found: %d %d", i, j); 
return; 

} 

else if (temp < K) 
i= i+ 1; 

else 

j = j -1; 

} 

return; 

} 

Time Complexity: Ofnlogri). If the given array is already sorted then the complexity is 0(n). 

Space Complexity: 0(1). 

Problem-12 Is there any other way of solving the Problem-9? 

Solution: Yes, using hash table. 

Since our objective is to find two indexes of the array whose sum is K. Let us say those indexes are X 
and Y. That means, A[X] + A[Y] = K. 

What we need is, for each element of the input array A[X], check whether K — A[X] also exists in 
input array. Now, let us simplify that searching with hash table. 

Algorithm 

• For each element of the input array, insert into the hash table. Let us say the current element 
is A [X]. 

• Before proceeding to the next element we check whether K - A [A] also exists in hash table or 
not. 
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• Existence of such number indicates that we are able to find the indexes. 

• Otherwise proceed to the next input element. 

Time Complexity: 0(n). 

Space Complexity: 0(n). 

Problem-13 Given an array A of n elements. Find three elements, i,j and k in the array such 
that A[i] 2 + A[j] 2 = A[k] 2 ? 

Solution: 

Algorithm 

• For each array index i compute A[i] 2 and store in array. 

• Now, the problem reduces to finding three indexes , i,j and k such that A[i] + A\j\ = A\k\. 
This is same as that of Problem-9. 

Problem-14 Two elements whose sum is closest to zero 

Given an array with both positive and negative numbers. We need to find the two elements such 
that their sum is closest to zero. For the below array, algorithm should give —80 and 85. 

Example: 1 60 - 10 70 - 80 85 

Solution: Brute Force Solution. 

For each element, find the sum of it with every other element in the array and compare sums. 
Finally, return the minimum sum. 

void TwoElementsWithMinSum(int A[], int n) 

1 

int inv_count = 0; 

int i, j, min_sum, sum, min_i, min_j; 

if(n < 2) 

1 

printf("Invalid Input"); 
return; 

3 

/* Initialization of values 7 
min_i = 0; 
min_j = 1; 

min_sum = A[0] + A[l]; 
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for(i= 0; i < n - 1; i ++) 

1 

for(j = i + 1; j < n; j++) 

1 

sum = A[i] + A[j]; 
if(abs(min_sum) > abs(sum)) 

1 

min_sum = sum; 
min_i = i; 
min_j = j; 

3 

3 

3 

printf(" The two elements are Tod and Tod", arr[min_i], arr[min_j]); 

} 

Time complexity: 0(n 2 ). 

Space Complexity: 0(1). 

Problem-15 Can we improve the time complexity of Problem-14? 

Solution: Use Sorting. 

Algorithm 

1) Sort all the elements of the given input array. 

2) Find the two elements on either side of zero (if they are all positive or all negative then we 
are done with the solution) 

3) If the one is positive and other is negative then add the two values at those positions. If the 
total is positive then increment the negative index, if it is negative then increment the 
positive index. If it is zero then stop. 

4) loop step (3) until we hit a zero total or reached the end of array. Store the best total as you 

go- 

Time Complexity: Ofnlogri), for sorting. 

Problem-16 Given an array of n elements. Find three elements in the array such that their sum is 
equal to given element K1 

Solution: Brute Force Approach. 

The default solution to this is, for each pair of input elements check whether there is any element 
whose sum is K. This we can solve just by using three simple for loops. The code for this solution can 
be given as: 
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void BruteForceSearch[int A[], int n, int data) 

1 

int i = 0, j = 0, k = 0; 
for (i = 0; i < n; i++) 

1 

for(j = i+1; j < n; j++) 

1 

for(k = j+1; k < n; k++) 

1 

if(A[i] + A[j] + A[k]== data) 

{ 

printf(“Items Found:%d %d %d”, i, j, k); 
return; 

} 

} 

3 

3 

printf(“Items not found: No such elements”); 

} 

Time Complexity: 0(n 3 ). This is because of three nested for loops. 

Space Complexity: 0(1). 

Problem-17 Does the solution of Problem-16 works even if the array is not sorted? 

Solution: Yes. Since we are checking all possibilities, the algorithm ensures that we can find three 
numbers whose sum is K if they exist. 

Problem-18 Can we use sorting technique for solving Problem-167 

Solution: Yes. 

void Search[int A[], int n, int data) 

1 

int i, j; 

Sort(A, n); 

for(k = 0; k < n; k++) 

1 

for(i = k + 1, j = n-1; i < j; ) 

1 

if (A[k] + A[i] + A[j] == data) 

[ 

printf(“Items Found:%d %d %d”, i, j, k); 
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return; 

} 

else if (A[k] + A[i] + A[j] < data) 
i= i+ 1; 

else 


return; 

} 

Time Complexity: Time for sorting + Time for searching in sorted list = 0(nlogri) + 0(n 2 ) ~ 
0(n 2 ). This is because of two nested for loops. 

Space Complexity: 0(1). 

Problem-19 Can we use hashing technique for solving Problem-167 

Solution: Yes. Since our objective is to find three indexes of the array whose sum is K. Let us say 
those indexes are X, Y and Z. That means, A[X] + A[Y] + A[Z] = K. 

Let us assume that we have kept all possible sums along with their pairs in hash table. That means the 
key to hash table is K — A[X] and values for K — A\X\ are all possible pairs of input whose sum is 
K - A[X]. 

Algorithm 

• Before starting the searching, insert all possible sums with pairs of elements into the hash 
table. 

• For each element of the input array, insert into the hash table. Let us say the current element 
is A [A]. 

• Check whether there exists a hash entry in the table with key: K — A[X], 

• If such element exists then scan the element pairs of K — A\X\ and return all possible pairs 
by including A [A] also. 

• If no such element exists (with K — A[X] as key) then go to next element. 

Time Complexity: Time for storing all possible pairs in Hash table + searching = 0(n 2 ) + 0(n 2 ) « 
0(n 2 ). 

Space Complexity: 0(n). 

Problem-20 Given an array of n integers, the 3 — sum problem is to determine find three integers 
whose sum is closest to zero. 

Solution: This is same as that of Problem-16. In this case, K value is zero. 
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Problem-21 Given an array of n numbers. Give an algorithm for finding the element which 
appears maximum number of times in the array? 

Solution: Brute Force. 

One simple solution to this is, for each input element check whether there is any element with same 
value and for each such occurrence, increment the counter. Each time, check the current counter 
with the max counter and update it if this its value is greater than max counter. This we can solve just 
by using two simple for loops. The code for this solution can be given as: 

int CheckDuplicatesBmteForce(int A[], int n) 

1 

int i = 0, j=0; 

int counter =0, max=0; 

for(i = 0; i < n; i++) 

1 

counter=0; 
for(j = 0; j < n; j++) 

1 

if(A[i] == A[j]) 

counter++; 

3 

if (counter > max) 

max = counter; 

} 

return max; 

} 

Time Complexity: 0(n 2 ). This is because of two nested for loops. 

Space Complexity: 0(1). 

Problem-22 Can we improve the complexity of Problem-21 solution? 

Solution: Yes. Sort the given array. After sorting all the elements with equal values come adjacent. 
Now, just do another scan on this sorted array and see which element is appearing maximum number 
of times. 

Time Complexity: 0(nlogri). (for sorting). 

Space Complexity: 0(1). 

Problem-23 Is there any other way of solving Problem-21? 
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Solution: Yes, using hash table. For each element of the input keep track of how many times that 
element appeared in the input. That means the counter value represents the number of occurrences 
for that element. 

Time Complexity: 0(n). 

Space Complexity: 0(n). 

Problem-24 For Problem-21, can we improve the time complexity? Assume that the elements 
range is o to n — 1. That means all the elements are within this range only. 

Solution: Yes. We solve this problem in two scans. We cannot use the negation technique of 
Problem-3 for this problem because of number of repetitions. 

In the first scan, instead of negating we add the value n. That means for each of occurrence of an 
element we add the array size to that element. 

In the second scan we check the element value by dividing it with n and we return the element 
whichever gives the maximum value. The code based on this method is given below. 

void MaxRepititions(int A[], int n) 

1 

int i = 0; 

int max = 0; 

for(i = 0; i < n; i++) 

1 

A[A[i]%n] +=n; 

} 

for(i = 0; i < n; i++) 

1 

if(A[i]/n > max) 

1 

max = A[i]/n; 
max =i; 

} 

} 

return max; 

} 

Note: 

• This solution does not work if the given array is read only. 

• This solution will work only if the array elements are positive. 

• If the elements range is not in 0 to n — 1 then it may give exceptions. 

Time Complexity: 0(n). Since no nested for loops are required. 
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Space Complexity: 0(1). 

Problem-25 Let A be an array of n distinct integers. Suppose A has the following property: there 
exists an index 1 < k < n such that A[l],... ,A[k] is an increasing sequence and A[k + 
1],..., .d[n] is a decreasing sequence. Design and analyze an efficient algorithm for finding k. 

Similar question: 

Lets us assume that the given array is sorted but starts with negative numbers and ends with 
positive numbers [such functions are called monotonically increasing function]. In this array find 
the starting index of the positive numbers. Let us assume that we know the length of the input 
array. Design a O(logri) algorithm. 

Solution: We use a variant of the binary search. 

int Search (int A[], int first, int last) 

1 

int first = 0; 
int last = n-1; 
int mid; 

while (first <= last) 

1 

// if the current array has size 1 
if(first == last) 

return Affirst]; 

// if the current array has size 2 
else if(first == last-1) 

return max(A[first], A[last]); 

// if the current array has size 3 or more 
else 
1 

mid = first + (last-first)/2; 

if(A[mid-l] < A[mid] && A[mid] > A[mid+1]) 
return A[mid]; 

else if(A[mid-l] < A[mid] && A[mid] < A[mid+1]) 
first = mid+1; 

else if(A[mid-l] > A[mid] && A[mid] > A[mid+1]) 
last = mid-1; 

else 

return INT_MIN ; 

] // end of else 
] // end of while 
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} 

The recursion equation is T(n) = 2T(n/2) + c. Using master theorem, we get O(logn). 

Problem-26 If we don't know n, how do we solve the Problem-25? 

Solution: Repeatedly compute A[l], A[2], A[4], A[8], A[16], and so on until we find a value of n such 
thatA[n] > 0. 

Time Complexity: 0(logn), since we are moving at the rate of 2. 

Refer Introduction to Analysis of Algorithms chapter for details on this. 

Problem-27 Given an input array of size unknown with all l's in the beginning and 0's in the 
end. Find the index in the array from where 0's start. Consider there are millions of l's and 0's 
in the array. E.g. array contents 1111111.1100000.0000000. 

Solution: This problem is almost similar to Problem-26. Check the bits at the rate of 2 K where 
k = 0,1,2 .... 

Since we are moving at the rate of 2, the complexity is 0(logn). 

Problem-28 Given a sorted array of n integers that has been rotated an unknown number of times, 
give a 0 ( log n) algorithm that finds an element in the array. 

Example: Find 5 in array (15 16 19 20 25 1 3 4 5 7 10 14) 

Output: 8 (the index of 5 in the array) 

Solution: Let us assume that the given array is A[]. Using solution of Problem-25, with extension. 
The below function FindPivot returns the k value (let us assume that this function return the index 
instead of value). Find the pivot point, divide the array in two sub-arrays and call binary search. 

The main idea for finding pivot is - for a sorted (in increasing order) and pivoted array, pivot element 
is the only element for which next element to it is smaller than it. Using above criteria and binary 
search methodology we can get pivot element in 0 ( logn ) time 

Algorithm 

1) Find out pivot point and divide the array in two sub-arrays. 

2) Now call binary search for one of the two sub-arrays. 

a. if element is greater than first element then search in left subarray 

b. else search in right subarray 

3) If element is found in selected sub-array then return index else return —1. 

int FindPivot(int A[], int start, int finish) 

1 
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if(finish - start == 0) 
return start; 

else if(start) == finish - 1) 

1 

if (A[start] >= A[finish]) 
return start; 

else 

return finish; 

3 

else 

1 

mid = start + (finish-start)/2; 
if (Afmid] >= A[mid +1]) 

return FindPivot(A, start, mid); 

else 

return FindPivot(A, mid, finish); 

3 

} 

int Search(int A[], int n, int x) 

1 

int pivot = FindPivot(A, 0, n-1); 
if(A[pivot] == x) 

return pivot; 
if(A[pivot] <= x) 

return Binary Search (A, 0, pivot-1, x); 

else 

return Binary Search (A, pivot+1, n-1, x); 

} 

int BinarySearch(int A[], int low, int high, int x) 

1 

if(high >= low) 

1 

int mid = low + (high - low)/2; 

if(x == A[mid]) 

return mid; 
if(x > A[mid]) 

return BinarySearch(A, (mid + 1), high, x); 

else 

return BinarySearch(A, low, (mid -1), x); 

3 

/‘Return -1 if element is not found*/ 
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return -1; 


} 


Time complexity: O(logn). 


Problem-29 For Problem-28, can we solve in one scan? 


Solution: Yes. 


int BinarySearchRotated(int A[], int start, int finish, int data) 

1 


int mid; 

if (start > finish) 
return -1; 

mid = start + (finish - start) / 2; 

if (data == A[mid]) 
return mid; 

else if (Afstart] <= A[mid]) 

{ // start half is in sorted order. 

if (data >= Afstart] && data < Afmid]) 

return BinarySearchRotated(A, start, mid - 1, data); 

else 

return BinarySearchRotated(A, mid + 1, finish, data); 


] 

else 

{ // Afmid] <= Affinish], finish half is in sorted order. 


if (data > Afmid] && data <= Affinish]) 

return BinarySearchRotated(A, mid + 1, finish, data); 


else 

return BinarySearchRotated(A, start, mid - 1, data); 


} 


} 


Time complexity: O(logri). 

Problem-30 Bitonic search. 

An array is bitonic if it is comprised of an increasing sequence of integers followed immediately 
by a decreasing sequence of integers. Given a bitonic array A of n distinct integers, describe how 
to determine whether a given integer is in the array in 0(logn ) steps. 

Solution: This is same as Problem-25. 
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Problem-31 Yet, other way of asking Problem-25? 

Let d[] be an array that starts out increasing, reaches a maximum, and then decreases. Design an 
O(logn) algorithm to find the index of the maximum value. 

Problem-32 Give an 0 (nlogri) algorithm for computing the median of a sequence of n integers. 

Solution: Sort and return element at n/2. 

Problem-33 Given two sorted lists of size m and n, find the median of all elements in 0(log (m + 
n)) time. 

Solution: Refer Divide and Conquer chapter. 

Problem-34 Give a sorted array A of n elements, possibly with duplicates, find the index of the 

first occurrence of a number in 0 ( logn ) time. 

Solution: To find the first occurrence of a number we need to check for the following condition. 
Return the position if any one of the following is true: 

mid == low && A[mid] == data || A[mid] == data && A[mid-1] < data 

int BinarySearchFirstOccurrence(int A[], int n, int low, int high, int data) 

{ 

int mid; 
if (high >= low) 

1 

mid = low + (high-low) / 2; 

if ((mid == low && A[mid] == data) || (A[mid] == data && A[mid - 1] < data)) 
return mid; 

// Give preference to left half of the array 
else if (A[mid] >= data) 

return BinarySearchFirstOccurrence (A, n, low, mid - 1, data); 

else 

return BinarySearchFirstOccurrence (A, n, mid + 1, high, data); 

} 

return -1; 

} 

Time Complexity: 0(logn). 

Problem-35 Give a sorted array A of n elements, possibly with duplicates, find the index of the last 

occurrence of a number in 0(logn ) time. 
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Solution: To find the last occurrence of a number we need to check for the following condition. 
Return the position if any one of the following is true: 

mid == high && A[mid] == data jj A[mid] == data && A[mid+1] > data 

int BinarySearchLastOccurrence(int A[], int n, int low, int high, int data) 

{ 

int mid; 
if (high >= low) 

1 

mid = low + (high-low) / 2; 

if ((mid == high && A[mid] == data) |j (A[mid] == data && A[mid + 1] > data)) 
return mid; 

// Give preference to right half of the array 
else if (A[mid] <= data) 

return BinarySearchLastOccurrence (A, n, mid + 1, high, data); 

else 

return BinarySearchLastOccurrence (A, n, low, mod - 1, data); 

3 

return -1; 

} 

Time Complexity: O(logri). 

Problem-36 Give a sorted array of n elements, possibly with duplicates, find the number of 
occurrences of a number. 

Solution: Brute For Approach. 

Do a linear search over the array and increment count as and when we find the element data in the 
array. 

int LinearSearchCount(int A[], int n, int data) 

1 

int count = 0; 

for (int i = 0; i < n; i++) 

1 

if (a[i] == k) 

count++; 

3 

return count; 
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} 


Time Complexity: 0(n). 

Problem-37 Can we improve the time complexity of Problem-36? 

Solution: Yes. We can solve this by using one binary search call followed by another small scan. 


Algorithm 


• Do a binary search for the data in the array. Let us assume its position be K. 

• Now traverse towards left from K and count the number of occurrences of data. Let this 
count be leftCount. 

• Similarly, traverse towards right and count the number of occurrences of data. Let this count 
be rightCount. 

• Total number of occurrences = leftCount + 1 + rightCount 

Time Complexity - O(logn + S) where S is the number of occurrences of data. 

Problem-38 Is there any alternative way of solving the Problem-36. 7 


Solution: 


Algorithm 


• Find the first occurrence of data and call its index as firstOccurrence (for algorithm refer 
Problem-34) 

• Find the last occurrence of data and call its index as lastOccurrence (for algorithm refer 
Problem-35) 

• Return lastOccurrence - firstOccurrence + 1 
Time Complexity = O(logn + logn) — O(logn). 

Problem-39 What is the next number in the sequence 1,11,21 and why? 

Solution: Read the given number loudly. This is just a fun problem. 

One one 
Two Ones 

One two, one one - ^ 1211 

So answer is, the next number is the representation of previous number by reading it loudly. 
Problem-40 Finding second smallest number efficiently. 
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Solution: We can construct a heap of the given elements using up just less than n comparisons (Refer 
Priority Queues chapter for algorithm). Then we find the second smallest using logn comparisons 
for the GetMax() operation. Overall, we get n + Logn + constant. 

Problem-41 Is there any other solution for Problem-40? 

Solution: Alternatively, split the n numbers into groups of 2, perform n/ 2 comparisons successively 
to find the largest using a tournament-like method. The first round will yield the maximum in n — 1 
comparisons. The second round will be performed on the winners of the first round and the ones the 
maximum popped. This will yield logn — 1 comparisons for a total of n + Ig n — 2. The above 
solution is called as tournament problem. 

Problem-42 An element is a majority if it appears more than n/2 times. Give and algorithm takes 
an array of n element as argument and identifies a majority (if it exists). 

Solution: The basic solution is to have two loops and keep track of maximum count for all different 
elements. If maximum count becomes greater than n/2 then break the loops and return the element 
having maximum count. If maximum count doesn’t become more than n/2 then majority element 
doesn’t exist. 

Time Complexity: 0(n 2 ). 

Space Complexity: 0(1). 

Problem-43 Can we improve the Problem-42 time complexity to O(nlogn)'! 

Solution: Using binary search we can achieve this. 

Node of the Binary Search Tree (used in this approach) will be as follows. 

struct TreeNode 

{ 

int element; 
int count; 

struct TreeNode ‘left; 
struct TreeNode ‘right; 

}BST; 

Insert elements in BST one by one and if an element is already present then increment the count of 
the node. At any stage, if count of a node becomes more than n/2 then return. The method works 
well for the cases where n/2 + 1 occurrences of the majority element is present in the starting of the 
array, for example (1, 1,1, 1,1, 2, 3, and 4). 
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Time Complexity: If a binary search tree is used then worst time complexity will be 0(n 2 ). If a 
balanced-binary-search tree is used then Ofnlogri). 

Space Complexity: 0(n). 

Problem-44 Is there any other of achieving 0 (nlogri) complexity for the Problem-42? 

Solution: Sort the input array and scan the sorted array to find the majority element. 

Time Complexity: 0(nlogri). 

Space Complexity: 0(1). 

Problem-45 Can we improve the complexity for the Problem-42? 

Solution: If an element occurs more than n /2 times in A then it must be the median of A. But, the 
reverse is not true, so once the median is found, we must check to see how many times it occurs in A. 
We can use linear selection which takes 0(n) time (for algorithm refer Selection Algorithms 
chapter). 

int CheckMajority(int A[], in n) 

1 

1) Use linear selection to find the median m of A. 

2) Do one more pass through A and count the number of occurrences of m. 

a. If m occurs more than n/2 times then return true; 

b. Otherwise return false. 

} 

Problem-46 Is there any other way of solving the Problem-42? 

Solution: Since only one element is repeating, we can use simple scan of the input array by keeping 
track of count for the elements. If the count is 0 then we can assume that the element is coming first 
time otherwise that the resultant element. 

int MajorityNum(int[] A, int n) 

1 

int majNum, count; 
element = -1; count = 0; 
for(int i = 0; i < n; i++) 

1 

// If the counter is 0 then set the current candidate to majority num and 
// we set the counter to 1. 
if(count == 0) 

1 

element = A[i]; 
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count = 1; 


} 

else if(element == A[i]) 


// Increment counter If the counter is not 0 and 
// element is same as current candidate. 
count++; 


else 


// Decrement counter If the counter is not 0 and 
// element is different from current candidate, 
count—; 


3 


return element; 


} 


Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-47 Given an array of 2n elements of which n elements are same and the remaining n 
elements are all different. Find the majority element. 

Solution: The repeated elements will occupy half the array. No matter what arrangement it is, only 
one of the below will be true, 

• All duplicate elements will be at a relative distance of 2 from each other. Ex: n, 1, n, 100, n, 
54, n... 

• At least two duplicate elements will be next to each other 

Ex: n,n, 1,100, n, 54, n,.... 


n, 1 ,n, n, n, 54,100 ... 
1,100,54,n,n, n, n.... 


So, in worst case, we need will two passes over the array, 

First Pass: compare A[i] and A[i + 1] 

Second Pass: compare A\l\ and A[i + 2] 

Something will match and that's your element. 

This will cost 0(n) in time and 0(1) in space. 
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Problem-48 Given an array with 2n + 1 integer elements, n elements appear twice in arbitrary 
places in the array and a single integer appears only once somewhere inside. Find the lonely 
integer with 0(n) operations and 0(1) extra memory. 

Solution: Since except one element all other elements are repeated. We know that AXOR A = 0. 
Based on this if we XOR all the input elements then we get the remaining element. 

int solution(int* A) 

1 

int i, res; 

for (i = res = 0; i < 2n+l; i++) 
res = res A A[i]; 
return res; 

} 

Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-49 Throwing eggs from an n-story building. 

Suppose that we have an n story building and a set of eggs. Also assume that an egg breaks if it is 
thrown off floor F or higher, and will not break otherwise. Devise a strategy to determine the 
floor F, while breaking 0 (logit) eggs. 

Solution: Refer Divide and Conquer chapter. 

Problem-50 Local minimum of an array. 

Given an array A of n distinct integers, design an 0(logn) algorithm to find a local minimum: an 
index i such that A[i — 1] < A[i] < A[i + 1], 

Solution: Check the middle value i4[n/2], and two neighbors d[n/2 — 11 and A\n/2 + 1], If 4|n/2| 
is local minimum, stop; otherwise search in half with smaller neighbor. 

Problem-51 Give an n x n array of elements such that each row is in ascending order and each 
column is in ascending order, devise an 0 (n) algorithm to determine if a given element x in the 
array. You may assume all elements in the n x n array are distinct. 

Solution: Let us assume that the given matrix is A [n] [n]. Start with the last row, first column [or first 
row - last column]. If the element we are searching for is greater than the element at i4[l][n], then 
the column 1 can be eliminated. If the search element is less than the element at j4[l][n], then the 
last row can be completely eliminated. Now, once the first column or the last row is eliminated, now, 
start over the process again with left-bottom end of the remaining array. In this algorithm, there 
would be maximum n elements that the search element would be compared with. 

Time Complexity: 0(n). This is because we will traverse at most 2 n points. 
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Space Complexity: 0(1). 


Problem-52 Given an nxn array a of n 2 numbers, Give an 0 (n) algorithm to find a pair of 


indices i and j such that 4[i][/] < A[i + l][/]M[i][/] < A[i]\j + l],d[i][/] < A[i — 1][/], and 
A[i]\j] < A[i]\j - 1], 


Solution: This problem is same as Problem-51. 

Problem-53 Given nxn matrix, and in each row all l’s are followed 0’s. Find row with maximum 
number of 0’s. 

Solution: Start with first row, last column. If the element is 0 then move to the previous column in 
the same row and at the same time increase the counter to indicate the maximum number of 0’s. If 
the element is 1 then move to the next row in the same column. Repeat this process until we reach 
last row, first column. 

Time Complexity: 0(2ri) ~ 0(n) (very much similar to Problem-51). 

Problem-54 Given an input array of size unknown with all numbers in the beginning and special 
symbols in the end. Find the index in the array from where special symbols start. 

Solution: Refer Divide and Conquer chapter. 

Problem-55 Finding the Missing Number 

We are given a list of n — 1 integers and these integers are in the range of 1 to n. There are no 
duplicates in list. One of the integers is missing in the list. Given an algorithm to find the missing 
integer. 

Example: 


I/P [1, 2,4, 6, 3,7,8] 
O/P 5 


Solution: Use sum formula 

1) Get the sum of numbers, sum — n * (n + l)/2 

2) Subtract all the numbers from sum and you will get the missing number. 

Time Complexity: 0(n), this is because we need to scan the complete array. 

Problem-56 In Problem-55, if the sum of the numbers goes beyond maximum allowed integer, 
then there can be integer overflow and we may not get correct answer. Can we solve this 
problem? 


Solution: 
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1) XOR all the array elements, let the result of XOR be X. 

2) XOR all numbers from 1 to n, let XOR be Y. 

3) XOR of X and Y gives the missing number. 

int FindMissingNumber(int A[], int n) 

1 

int i, X, Y; 

for (i = 0; i < 9; i ++) 

X A =A[i]; 

for (i = 1; i <= 10; i ++) 

Y A = i; 

//In fact, one variable is enough, 
return X A Y; 

} 

Time Complexity: 0(ri), this is because we need to scan the complete array. 

Problem-57 Find the Number Occurring Odd Number of Times 

Given an array of positive integers, all numbers occurs even number of times except one number 
which occurs odd number of times. Find the number in 0(n) time & constant space. 

Example: 

I/P = [1,2,3,2,3,1,3] 

O/P = 3 

Solution: Do a bitwise XOR of all the elements. Finally we get the number which has odd 
occurrences. This is because of the fact that, A XOR A = 0. 

Time Complexity: 0(n). 

Problem-58 Find the two repeating elements in a given array 

Given an array with n + 2 elements, all elements of the array are in range 1 to n and also all 
elements occur only once except two numbers which occur twice. Find those two repeating 
numbers. 

Example: 6, 2, 6, 5, 2, 3,1 and n = 5 

The above input hasn + 2 = 7 elements with all elements occurring once except 2 and 6 
which occur twice. So the output should be 6 2. 

Solution: One simple way to scan the complete array for each element of the input elements. That 
means use two loops. In the outer loop, select elements one by one and count the number of 
occurrences of the selected element in the inner loop. 
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void PrintRepeatedElements(int A[], int n) 

1 


} 


int i, j; 

for(i = 0; i < n; i++) 

for(j = i+1; j <n; j++) 

if(A[i] == A[j]) 

printf(“%d”, A[i]); 


Time Complexity: 0(n 2 ). 
Space Complexity: 0(1). 


Problem-59 For the Problem-58, can we improve the time complexity? 


Solution: Sort the array using any comparison sorting algorithm and see if there are any elements 
which contiguous with same value. 


Time Complexity: Ofnlogri). 

Space Complexity: 0(1). 

Problem-60 For the Problem-58, can we improve the time complexity? 

Solution: Use Count Array. This solution is like using a hash table. But for simplicity we can use 
array for storing the counts. Traverse the array once. While traversing, keep track of count of all 
elements in the array using a temp array count [] of size n, when we see an element whose count is 
already set, print it as duplicate. 


void PrintRepeatedElements(int A[], int n) 

1 

int “count = (int *)calloc(sizeof(int), (n - 2)); 
for(int i = 0; i < size; i++) 

1 

if(count[A[i]] == 1) 

printf(“%d”, A[i]); 

else 


} 


3 


count[A[i]]++; 


Time Complexity: 0(n). 

Space Complexity: 0(n). 

Problem-61 Consider the Problem-58. Let us assume that the numbers are in the range 1 to n. Is 
there any other way of solving the problem? 
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Solution: Using XOR Operation. Let the repeating numbers be X and Y, if we xor all the elements in 
the array and all integers from 1 to n, then the result is X XOR Y. 

The l’s in binary representation of X XOR Y is corresponding to the different bits between X and Y. 
Suppose that the k th bit of X XOR Y is 1, we can XOR all the elements in the array and all integers 
from 1 to n, whose k th bits are 1. The result will be one of X and Y. 

void PrintRepeatedElements (int A[], int size) 

1 

int XOR = A[0]; 
int right_most_set_bit_no; 
int n = size - 2; 
int X= 0, Y = 0; 


/* Compute XOR of all elements in A[]7 
for(int i = 0; i < n; i++) 

XOR A =A[i]; 


/* Compute XOR of all elements [1, 2 ,.n} 7 
for(i = 1; i <= n; i++) 

XOR A = i; 

I* Get the rightmost set bit in right_most_set_bit_no 7 
right_most_set_bit_no = XOR & ~(XOR -1); 

/* Now divide elements in two sets by comparing rightmost set 7 
for(i = 0; i < n; i++) 

1 

if(A[i] & right_most_set_bit_no) 

X = X A A[i]; /*XOR of first set in A[] 7 

else 

Y = Y A A[i]; /*XOR of second set inA[] 7 

3 

for(i = 1; i <= n; i++) 

1 

if(i & right_most_set_bit_no) 

X = X A i; /*XOR of first set in A[] and [1, 2, ...n }7 

else 

Y = Y A i; /*XOR of second set in A[] and [1, 2, ...n } 7 

3 

printf(“%d and %d”,X, Y); 
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Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-62 Consider the Problem-58. Let us assume that the numbers are in the range 1 to n. Is 
there yet other way of solving the problem? 

Solution: We can solve this by creating two simple mathematical equations. Let us assume that two 
numbers which we are going to find are X and Y. We know the sum of n numbers is n(n + l)/2 and 
product is n\. Make two equations using these sum and product formulae, and get values of two 
unknowns using the two equations. 

Let summation of all numbers in array be S and product be P and the numbers which are being 
repeated are X and Y. 

X + Y = S - n(n + l)/2 
XY = P/n\ 

Using above two equations, we can find out X and Y. 

There can be addition and multiplication overflow problem with this approach. 

Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-63 Similar to Problem-58. Let us assume that the numbers are in the range 1 to n. Also, 
n — 2 elements are repeating thrice and remaining two elements are repeating twice. Find the 
element which is repeating twice. 

Solution: If we xor all the elements in the array and all integers from 1 to n, then the all the elements 
which are trice will become zero This is because, since the element is repeating trice and XOR with 
another time from range makes that element appearing four times. As a result, output 
of a XOR a XOR a XOR a = 0. Same is case with all elements which repeated thrice. 

With the same logic, for the element which repeated twice, if we XOR the input elements and also 
the range, then the total number of appearances for that element is 3. As a result, output 
of a XOR a XOR a = a. Finally, we get the element which repeated twice. 

Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-64 Separate Even and Odd numbers 

Given an array A[], write a function that segregates even and odd numbers. The functions should 
put all even numbers first, and then odd numbers. 

Example: 

Input = (12,34,45,9,8,90,3} 

Output = (12,34,90,8,9,45, 3} 
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In the output, order of numbers can be changed, i.e., in the above example 34 can come before 12 
and 3 can come before 9. 

Solution: The problem is very similar to Separate O’s and l’s (Problem-65^ in an array, and both of 
these problems are variation of famous Dutch national flag problem. 

Algorithm: Logic is little similar to Quick sort. 

1) Initialize two index variables left and right: left = 0, right = n — 1 

2) Keep incrementing left index until we see an odd number. 

3) Keep decrementing right index until we see an even number. 

4) If left < right then swap A[left] and A [right] 

Implementation: 

void DutchNationalFlag(int A[], int n) 

1 

/* Initialize left and right indexes 7 
int left = 0, right = n-1; 
while (left < right) 

1 

/* Increment left index while we see 0 at left 7 
while(A[left]%2 == 0 && left < right) 
left++; 

/* Decrement right index while we see 1 at right 7 
while(A[right]%2 == 1 && left < right) 
right—; 

if(left < right) 

[ 

/* Swap A[left] and A[right]7 
swap(&A[left], &A[right]); 
left++; 
right—; 

3 

} 

} 

Time Complexity: 0(n). 

Problem-65 Other way of asking Problem-64 but with little difference. 

Separate O’s and l’s in an array 
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We are given an array of O’s and l’s in random order. Separate O’s on left side and l’s on right side 
of the array. Traverse array only once. 

Input array = [0,1, 0,1,0, 0,1,1,1,0] 

Output array = [0, 0, 0, 0, 0,1,1, 1,1, 1] 

Solution: Counting O’s or l’s 

1. Count the number of O’s. Let count be C. 

2. Once we have count, we can put C O’s at the beginning and l’s at the remaining n- C positions 
in array. 

Time Complexity: 0(n). This solution scans the array two times. 

Problem-66 Can we solve the Problem-65 in once scan? 

Solution: Yes. Use two indexes to traverse: 

Maintain two indexes. Initialize first index left as 0 and second index right as n — 1. 

Do following while left < right-. 

1) Keep incrementing index left while there are 0s at it 

2) Keep decrementing index right while there are Is at it 

3) If left < right then exchange A[left] and A[right] 

/‘Function to put all 0s on left and all Is on right*/ 
void Separate0andl(int A[], int n) 

1 

/* Initialize left and right indexes V 
int left = 0, right = n-1; 
while (left < right) 

[ 

/* Increment left index while we see 0 at left V 
while(A[left] == 0 && left < right) 
left++; 

/* Decrement right index while we see 1 at right 7 
while(A[right] == 1 && left < right) 
right-; 

/* If left is smaller than right then there is a 1 at left 
and a 0 at right. Swap A[left] and A[right]7 
if(left < right) 

[ 
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A[left] = 0; 
A [right] = 1; 
left++; 
right-; 


] 


} 


] 


Time Complexity: 0(n). 

Space Complexity: 0(1). 

Problem-67 Maximum difference between two elements 

Given an array A 11 of integers, find out the difference between any two elements such that larger 
element appears after the smaller number in A []. 

Examples: If array is [2, 3,10, 6, 4, 8,1] then returned value should be 8 (Diff between 10 and 2). 
If array is [ 7, 9, 5, 6, 3,2 ] then returned value should be 2 (Difference between 7 and 9) 

Solution: Refer Divide and Conquer chapter. 

Problem-68 Given an array of 101 elements. Out of them 25 elements are repeated twice, 12 
elements are repeated 4 times and one element is repeated 3 times. Find the element which 
repeated 3 times in 0(1). 

Solution: Before solving this problem let us consider the following XOR operation property. 


aXORa = 0 


That means, if we apply the XOR on same elements then the result is 0. Let us apply this logic for this 
problem. 


Algorithm: 


• XOR all the elements of the given array and assume the result is A. 

• After this operation, 2 occurrences of number which appeared 3 times becomes 0 and one 

occurrence will remain. 

• The 12 elements which are appearing 4 times become 0. 

• The 25 elements which are appearing 2 times become 0. 

So just XOR’ing all the elements give the result. 

Time Complexity: 0(n), because we are doing only once scan. 

Space Complexity: 0(1). 
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Problem-69 Given an array A of n numbers. Find all pairs of X and Y in the array such that 
K = X * Y. Give an efficient algorithm without sorting. 

Solution: Create a hash table from the numbers that divide K. Divide K by a number and check for 
quotient in the table. 

Problem-70 Given a number n, give an algorithm for finding the number of trailing zeros in n\ . 

Solution: 

int NumberOfTrailingZerosInNumber(int n) 

1 

int i, count = 0; 
if (n < 0) 

return -1; 

for (i = 5; n / i > 0; i *= 5) 
count += n / i; 
return count; 

} 

Time Complexity: 0(logri), 
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